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PHYSICAL FOUNDATIONS OF COSMOLOGY 


Inflationary cosmology has been developed over the last 20 years to remedy serious 
shortcomings in the standard hot big bang model of the universe.Taking an original 
approach, this textbook explains the basis of modern cosmology and shows where 
the theoretical results come from. 

The book is divided into two parts: the first deals with the homogeneous and 
isotropic model of the universe, while the second part discusses how initial inhomo- 
geneities can explain the observed structure of the universe. Analytical treatments 
of traditionally highly numerical topics — such as primordial nucleosynthesis, re- 
combination and cosmic microwave background anisotropy — are provided, and 
inflation and quantum cosmological perturbation theory are covered in great de- 
tail. The reader is brought to the frontiers of current cosmological research by the 
discussion of more speculative ideas. 

This is an ideal textbook both for advanced students of physics and astrophysics 
and for those with a particular interest in theoretical cosmology. Nearly every 
formula in the book is derived from basic physical principles covered in undergrad- 
uate courses. Each chapter includes all necessary background material and no prior 
knowledge of general relativity and quantum field theory is assumed. 


VIATCHESLAV MUKHANOv is Professor of Physics and Head of the As- 
troparticle Physics and Cosmology Group at the Department of Physics, Ludwig- 
Maximilians-Universitét Mtinchen, Germany. Following his Ph.D. at the Moscow 
Physical-Technical Institute, he conducted research at the Institute for Nuclear 
Research, Moscow, between 1982 and 1991. From 1992, he was a lecturer at 
Eidgenossische Technische Hochschule (ETH) in Ziirich, Switzerland, until his ap- 
pointment at LMU in 1997. His current research interests include cosmic microwave 
background fluctuations, inflationary models, string cosmology, the cosmological 
constant problem, dark energy, quantum and classical black holes, and quantum 
cosmology. He also serves on the editorial boards of leading research journals in 
these areas. 

In 1980-81, Professor Mukhanov and G. Chibisov discovered that quantum 
fluctuations could be responsible for the large-scale structure of the universe. They 
calculated the spectrum of fluctuations in a model with a quasi-exponential stage 
of expansion, later known as inflation. The predicted perturbation spectrum is in 
very good agreement with measurements of the cosmic microwave background 
fluctuations. Subsequently, Professor Mukhanov developed the quantum theory 
of cosmological perturbations for calculating perturbations in generic inflationary 
models. In 1988, he was awarded the Gold Medal of the Academy of Sciences of 
the USSR for his work on this theory. 
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Foreword by Professor Andrei Linde 


Since the beginning of the 1970s, we have witnessed spectacular progress in the 
development of cosmology, which started with a breakthrough in the theoretical 
understanding of the physical processes in the early universe and culminated in a se- 
ries of observational discoveries. The time is ripe for a textbook which summarizes 
the new knowledge in a rigorous and yet accessible form. 

The beginning of the new era in theoretical cosmology can be associated with the 
development of the gauge theories of weak, electromagnetic and strong interactions. 
Until that time, we had no idea of properties of matter at densities much greater than 
nuclear density ~ 10'4 g/cm?, and everybody thought that the main thing we need 
to know about the early universe is the equation of state of superdense matter. In the 
beginning of the 1970s we learned that not only the size and the temperature of our 
universe, but also the properties of elementary particles in the early universe were 
quite different from what we see now. According to the theory of the cosmological 
phase transitions, during the first 10~!° seconds after the big bang there was not 
much difference between weak and electromagnetic interactions. The discovery of 
the asymptotic freedom for the first time allowed us to investigate the properties of 
matter even closer to the big bang, at densities almost 80 orders of magnitude higher 
than the nuclear density. Development of grand unified theories demonstrated that 
baryon number may not be conserved, which cleared the way towards the theoretical 
description of the creation of matter in the universe. This in its turn opened the 
doors towards inflationary cosmology, which can describe our universe only if the 
observed excess of baryons over antibaryons can appear after inflation. 

Inflationary theory allowed us to understand why our universe is so large and flat, 
why it is homogeneous and isotropic, why its different parts started their expansion 
simultaneously. According to this theory, the universe at the very early stages of 
its evolution rapidly expanded (inflated) in a slowly changing vacuum-like state, 
which is usually associated with a scalar field with a large energy density. In the 
simplest version of this theory, called “chaotic inflation,’ the whole universe could 
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emerge from a tiny speck of space of a Planckian size 10~** cm, with a total mass 
smaller than 1 milligram. All elementary particles surrounding us were produced 
as a result of the decay of this vacuum-like state at the end of inflation. Galaxies 
emerged due to the growth of density perturbations, which were produced from 
quantum fluctuations generated and amplified during inflation. In certain cases, 
these quantum fluctuations may accumulate and become so large that they can 
be responsible not only for the formation of galaxies, but also for the formation 
of new exponentially large parts of the universe with different laws of low-energy 
physics operating in each of them. Thus, instead of being spherically symmetric and 
uniform, our universe becomes a multiverse, an eternally growing fractal consisting 
of different exponentially large parts which look homogeneous only locally. 

One of the most powerful tools which can be used for testing the predictions 
of various versions of inflationary theory is the investigation of anisotropy of the 
cosmic microwave background (CMB) radiation coming to us from all directions. 
By studying this radiation, one can use the whole sky as a giant photographic plate 
with the amplified image of inflationary quantum fluctuations imprinted on it. The 
results of this investigation, in combination with the study of supernova and of the 
large-scale structure of the universe, have already confirmed many of the predictions 
of the new cosmological theory. 

From this quick sketch of the evolution of our picture of the universe during the 
last 30 years one can easily see how challenging it may be to write a book serving 
as a guide in this vast and rapidly growing area of physics. That is why it gives 
me a special pleasure to introduce the book Physical Foundations of Cosmology by 
Viatcheslav Mukhanov. 

In the first part of the book the author considers a homogeneous universe. One 
can find there not only the description of the basic cosmological models, but also 
an excellent introduction to the theory of physical processes in the early universe, 
such as the theory of nucleosynthesis, the theory of cosmological phase transitions, 
baryogenesis and inflationary cosmology. All of the necessary concepts from the 
general theory of relativity and particle physics are introduced and explained in 
an accurate and intuitively clear way. This part alone could be considered a good 
textbook in modern cosmology; it may serve as a basis for a separate course of 
lectures on this subject. 

But if you are preparing for active research in modern cosmology, you may 
particularly appreciate the second part of the book, where the author discusses the 
formation and evolution of the large-scale structure of our universe. In order to 
understand this process, one must learn the theory of production of metric pertur- 
bations during inflation. 

In 1981 Mukhanov and Chibisov discovered, in the context of the Starobinsky 
model, that the accelerated expansion can amplify the initial quantum perturbations 
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of metric up to the values sufficient for explaining the large-scale structure of the 
universe. In 1982, a combined effort of many participants of the Nuffield Sym- 
posium in Cambridge allowed them to come to a similar conclusion with respect 
to the new inflationary universe scenario. A few years later, Mukhanov developed 
the general theory of inflationary perturbations of metric, valid for a broad class of 
inflationary models, including chaotic inflation. Since that time, his approach has 
become the standard method of investigation of inflationary perturbations. 

A detailed description of this method is one of the most important features 
of this book. The theory of inflationary perturbations is quite complicated not 
only because it requires working knowledge of General Relativity and quantum 
field theory, but also because one should learn how to represent the results of the 
calculations in terms of variables that do not depend on the arbitrary choice of 
coordinates. It is very important to have a real master guiding you through this 
difficult subject, and Mukhanov does it brilliantly. He begins with a reminder of the 
simple Newtonian approach to the theory of density perturbations in an expanding 
universe, then extends this investigation to the general theory of relativity, and 
finishes with the full quantum theory of production and subsequent evolution of 
inflationary perturbations of metric. 

The last chapter of the book provides the necessary link between this theory and 
the observations of the CMB anisotropy. Everyone who has studied this subject 
knows the famous figures of the spectrum of the CMB anisotropy, with several 
different peaks predicted by inflationary cosmology. The shape of the spectrum 
depends on various cosmological parameters, such as the total density of matter in 
the universe, the Hubble constant, etc. By measuring the spectrum one can determine 
these parameters experimentally. The standard approach is based on the numerical 
analysis using the CMBFAST code. Mukhanov made one further step and derived 
an analytic expression for the CMB spectrum, which can help the readers to obtain 
a much better understanding of the origin of the peaks, of their position and their 
height as a function of the cosmological parameters. 

As in a good painting, this book consists of many layers. It can serve as an 
introduction to cosmology for the new generation of researchers, but it also contains 
a lot of information which can be very useful even for the best experts in this subject. 

We live at a very unusual time. According to the observational data, the universe 
is approximately 14 billion years old. A hundred years ago we did not even know 
that it is expanding. A few decades from now we will have a detailed map of the 
observable part of the universe, and this map is not going to change much for the 
next billion years. We live at the time of the great cosmological discoveries, and I 
hope that this book will help us in our quest. 


Preface 


This textbook is designed both for serious students of physics and astrophysics 
and for those with a particular interest in learning about theoretical cosmology. 
There are already many books that survey current observations and describe theo- 
retical results; my goal is to complement the existing literature and to show where 
the theoretical results come from. Cosmology uses methods from nearly all fields 
of theoretical physics, among which are General Relativity, thermodynamics and 
statistical physics, nuclear physics, atomic physics, kinetic theory, particle physics 
and field theory. I wanted to make the book useful for undergraduate students and, 
therefore, decided not to assume preliminary knowledge in any specialized field. 
With very few exceptions, the derivation of every formula in the book begins with 
basic physical principles from undergraduate courses. Every chapter starts with a 
general elementary introduction. For example, I have tried to make such a geometri- 
cal topic as conformal diagrams understandable even to those who have only a vague 
idea about General Relativity. The derivations of the renormalization group equa- 
tion, the effective potential, the non-conservation of fermion number, and quantum 
cosmological perturbations should also, in principle, require no prior knowledge of 
quantum field theory. All elements of the Standard Model of particle physics needed 
in cosmological applications are derived from the initial idea of gauge invariance 
of the electromagnetic field. Of course, some knowledge of general relativity and 
particle physics would be helpful, but this is not a necessary condition for under- 
standing the book. It is my hope that a student who has not previously taken the 
corresponding courses will be able to follow all the derivations. 

This book is meant to be neither encyclopedic nor a sourcebook for the most 
recent observational data. In fact, I avoid altogether the presentation of data; after 
all the data change very quickly and are easily accessible from numerous available 
monographs as well as on the Internet. Furthermore, I have intentionally restricted 
the discussion in this book to results that have a solid basis. I believe it is premature 
to present detailed mathematical consideration of controversial topics in a book on 
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the foundations of cosmology and, therefore, such topics are covered only at a very 
elementary level. 

Inflationary theory and the generation of primordial cosmological perturbations, 
which I count among the solid results, are discussed in great detail. Here, I have 
tried to delineate carefully the robust features of inflation which do not depend 
on the particular inflationary scenario. Among the other novel features of the book 
is the analytical treatment of some topics which are traditionally considered as 
highly numerical, for example, primordial nucleosynthesis, recombination and the 
cosmic microwave background anisotropy. 

Some words must be said about my decision to imbed problems in the main 
text rather than gathering them at the end. I have tried to make the derivations 
as transparent as possible so that the reader should be able to proceed from one 
equation to the next without making calculations on the way. In cases where this 
strategy failed, I have included problems, which thus constitute an integral part of 
the main text. Therefore, even the casual reader who is not solving the problems is 
encouraged to read them. 
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Units and conventions 


Planckian (natural) units Gravity, quantum theory and thermodynamics play an 
important role in cosmology. It is not surprising, therefore, that all fundamental 
physical constants, such as the gravitational constant G, Planck’s constant h, the 
speed of light c and Boltzmann’s constant kg, enter the main formulae describing 
the universe. These formulae look much nicer if one uses (Planckian) natural units 
by setting G = h = c = kg = 1. In this case, all constants drop from the formulae 
and, after the calculations are completed, they can easily be restored in the final 
result if needed. For this reason, nearly all the calculations in this book are made 
using natural units, though the gravitational constant and Planck’s constant are 
kept in some formulae in order to stress the relevance of gravitational and quantum 
physics for describing the corresponding phenomena. 

After the formula for some physical quantity is derived in Planckian units, one 
can immediately calculate its numerical value in usual units simply by using the 
values of the elementary Planckian units: 





Gh 1/2 
Ip) = (S) = 1.616 x 10-*° cm, 
Cc 


/ 
Pope 5 B01 10 
Cc 


hele 
mp, = (=) = 2.177 x 10° g, 


m pic 32 19 
Tp; = - = 1.416 x 10°° K= 1.221 x 10°° GeV. 
B 





Planckian units with other dimensions can easily be built out of these quantities. 
For example, the Planckian density and the Planckian area are €p; = m p;/ ee = 
5.157 x 10° gcm™? and Sp; = ice = 2.611 x 10~% cm’, respectively. 

Two examples below show how to make calculations using Planckian units. 
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Example 1 Calculate the number density of photons in the background radiation 
today. In usual units, the temperature of the background radiation is T ~ 2.73 K. 
In dimensionless Planckian units, this temperature is equal to 

a 2.73 K 

~ 1.416 x 10°2K 
The number density of photons in natural units is 
SEO) 3 Fe 1202 

T° + —.— 

2707 257 
To determine the number density of photons per cubic centimeter, we must multiply 


the dimensionless density obtained by the Planckian quantity with the correspond- 
ing dimension cm~*, namely / a : 


~ 1.93 x 107”. 





(1.93 x 10-%2)° ~ 1.31 x 107%. 


ny = 


ny ~ 1.31 x 107% x (1.616 x 107 cm) > ~ 310 cm. 


Example 2 Determine the energy density of the universe 1 s after the big bang 
and estimate the temperature at this time. The early universe is dominated by ultra- 
relativistic matter, and in natural units the energy density ¢ is related to the time f¢ 
via 


3 
é==—. 
327 t? 
The time 1 s expressed in dimensionless units is 
1 
1 : ~ 1.86 x 10%; 


~~ $,391 x 10-* s 
hence the energy density at this time is equal to 
3 


e= 5 = 8.63 x 10°” 
327 (1.86 x 1043) 





Planckian units. To express the energy density in usual units, we have to multiply 
this number by the Planckian density, ¢p) = 5.157 x 10°° g cm~>. Thus we obtain 


e = (8.63 x 107) ep, ~ 4.45 x 10° gem. 


To make a rough estimate of the temperature, we note that in natural units ¢ ~ T?, 
1/4 : ; : 
hence T ~ ¢!/4 ~ (10-88) a 10-22 Planckian units. In usual units, 


T ~ 10°” Tp; ~ 10!° K = 1 MeV. 


From this follows the useful relation between the temperature in the early Universe, 
measured in MeV, and the time, measured in seconds: Tyey = O(1) es - 
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Astronomical units In astronomy, distances are usually measured in parsecs and 
megaparsecs instead of centimeters. They are related to centimeters via 


1 pe = 3.26 light years = 3.086 x 10'S cm, 1 Mpc = 10° pe. 


The masses of galaxies and clusters of galaxies are expressed in terms of the mass 
of the Sun, 


Mo ~ 1.989 x 10% g. 


Charge units We use the Heaviside—Lorentz system for normalization of the ele- 
mentary electric charge e. This system is adopted in most books on particle physics 
and in these units the Coulomb force between two electrons separated by a distance 
ris 
By) 
F= , 
Arr? 


The dimensionless fine structure constant is @ = e?/4m ~ 1/137. 





Signature Throughout the book, we will always use the signature (+, —, —, —) for 
the metric, so that the Minkowski metric takes the form ds* = dt? — dx” — dy” — 
dz’. 


Part I 


Homogeneous isotropic universe 


1 


Kinematics and dynamics of an expanding universe 


The most important feature of our universe is its large scale homogeneity and 
isotropy. This feature ensures that observations made from our single vantage point 
are representative of the universe as a whole and can therefore be legitimately used 
to test cosmological models. 

For most of the twentieth century, the homogeneity and isotropy of the universe 
had to be taken as an assumption, known as the “Cosmological Principle.” Physicists 
often use the word “principle” to designate what are at the time wild, intuitive 
guesses in contrast to “laws,” which refer to experimentally established facts. 

The Cosmological Principle remained an intelligent guess until firm empirical 
data, confirming large scale homogeneity and isotropy, were finally obtained at the 
end of the twentieth century. The nature of the homogeneity is certainly curious. 
The observable patch of the universe is of order 3000 Mpc (1 Mpc ~ 3.26 x 
10° light years ~ 3.08 x 10*4 cm). Redshift surveys suggest that the universe is 
homogeneous and isotropic only when coarse grained on 100 Mpc scales; on smaller 
scales there exist large inhomogeneities, such as galaxies, clusters and superclusters. 
Hence, the Cosmological Principle is only valid within a limited range of scales, 
spanning a few orders of magnitude. 

Moreover, theory suggests that this may not be the end of the story. According 
to inflationary theory, the universe continues to be homogeneous and isotropic 
over distances larger than 3000 Mpc, but it becomes highly inhomogeneous when 
viewed on scales much much larger than the observable patch. This dampens, to 
some degree, our hope of comprehending the entire universe. We would like to 
answer such questions as: What portion of the entire universe is like the part we 
find ourselves in? What fraction has a predominance of matter over antimatter? 
Or is spatially flat? Or is accelerating or decelerating? These questions are not 
only difficult to answer, but they are also hard to pose in a mathematically precise 
way. And, even if a suitable mathematical definition can be found, it is difficult to 
imagine how we could verify empirically any theoretical predictions concerning 
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scales greatly exceeding the observable universe. The subject is too seductive to 
avoid speculations altogether, but we will, nevertheless, try to focus on the salient, 
empirically testable features of the observable universe. 

It is firmly established by observations that our universe: 


¢ is homogeneous and isotropic on scales larger than 100 Mpc and has well developed 
inhomogeneous structure on smaller scales; 
* expands according to the Hubble law. 


Concerning the matter composition of the universe, we know that: 


* itis pervaded by thermal microwave background radiation with temperature T ~ 2.73 K; 

° there is baryonic matter, roughly one baryon per 10° photons, but no substantial amount 
of antimatter; 

¢ the chemical composition of baryonic matter is about 75% hydrogen, 25% helium, plus 
trace amounts of heavier elements; 

¢ baryons contribute only a small percentage of the total energy density; the rest is a dark 
component, which appears to be composed of cold dark matter with negligible pressure 
(~25%) and dark energy with negative pressure (~70%). 


Observations of the fluctuations in the cosmic microwave background radiation 
suggest that: 


° there were only small fluctuations of order 10~> in the energy density distribution when 
the universe was a thousand times smaller than now. 


For a review of the observational evidence the reader is encouraged to refer 
to recent papers and reviews. In this book we concentrate mostly on theoretical 
understanding of these basic observational facts. 

Any cosmological model worthy of consideration must be consistent with estab- 
lished facts. While the standard big bang model accommodates most known facts, 
a physical theory is also judged by its predictive power. At present, inflationary the- 
ory, naturally incorporating the success of the standard big bang, has no competitor 
in this regard. Therefore, we will build upon the standard big bang model, which 
will be our starting point, until we reach contemporary ideas of inflation. 


1.1 Hubble law 


In a nutshell, the standard big bang model proposes that the universe emerged about 
15 billion years ago with a homogeneous and isotropic distribution of matter at very 
high temperature and density, and has been expanding and cooling since then. We 
begin our account with the Newtonian theory of gravity, which captures many of 
the essential aspects of the universe’s dynamics and gives us an intuitive grasp of 
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what happens. After we have reached the limits of validity of Newtonian theory, 
we turn to a proper relativistic treatment. 

In an expanding, homogeneous and isotropic universe, the relative velocities of 
observers obey the Hubble law: the velocity of observer B with respect to A is 


Vaca) = A(t)rpa, (1.1) 


where the Hubble parameter H(t) depends only on the time f, and rg, is the vector 
pointing from A to B. Some refer to H as the Hubble “constant” to stress its 
independence of the spatial coordinates, but it is important to recognize that H is, 
in general, time-varying. 

In a homogeneous, isotropic universe there are no privileged vantage points 
and the expansion appears the same to all observers wherever they are located. The 
Hubble law is in complete agreement with this. Let us consider how two observers A 
and B view a third observer C (Figure 1.1). The Hubble law specifies the velocities 
of the other two observers relative to A: 


vaca) = A(t)rea, Vera) = A(t)rca. (1.2) 


From these relations, we can find the relative velocity of observer C with respect 
to observer B: 


VovB) = Voca) — Vaca) = A(t)(rca — VBA) = Arce. (1.3) 


The result is that observer B sees precisely the same expansion law as observer A. 
In fact, the Hubble law is the unique expansion law compatible with homogeneity 
and isotropy. 








A IBA B VBC) 
Fig. 1.1. 
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Fig. 1.2. 


Problem 1.1 In order for a general expansion law, v = f (r,t), to be the same for 
all observers, the function f must satisfy the relation 


ftca — Vga,t) = f(rca,t) —f£ (rea, 2). (1.4) 
Show that the only solution of this equation is given by (1.1). 


A useful analogy for envisioning Hubble expansion is the two-dimensional sur- 
face of an expanding sphere (Figure 1.2). The angle 04% between any two points A 
and B on the surface of the sphere remains unchanged as its radius a(t) increases. 
Therefore the distance between the points, measured along the surface, grows as 


rapt) = a(t)Oaz, (1.5) 
implying a relative velocity 
’ a 
VAB =A = 4(t)Oap = ql AB (1.6) 


where dot denotes a derivative with respect to time t. Thus, the Hubble law emerges 
here with H(t) = a/a. 

The distance between any two observers A and B ina homogeneous and isotropic 
universe can be also rewritten in a form similar to (1.5). Integrating the equation 


fpqa = A(t)rga, (1.7) 


we obtain 


rpa(t) = a(t) Xa, (1.8) 


a(t) = exp( fo ar) (1.9) 


where 
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is called the scale factor and is the analogue of the radius of the 2-sphere. The 
integration constant, xz, is the analogue of 0g, and can be interpreted as the dis- 
tance between points A and B at some particular moment of time. It is called the 
Lagrangian or comoving coordinate of B, assuming a coordinate system centered 
at A. 

In the 2-sphere analogy, a(t) has a precise geometrical interpretation as the radius 
of the sphere and, consequently, has a fixed normalization. In Newtonian theory, 
however, the value of the scale factor a(t) itself has no geometrical meaning and 
its normalization can be chosen arbitrarily. Once the normalization is fixed, the 
scale factor a(t) describes the distance between observers as a function of time. For 
example, when the scale factor increases by a factor of 3, the distance between any 
two observers increases threefold. Therefore, when we say the size of the universe 
was, for instance, 1000 times smaller, this means that the distance between any two 
comoving objects was 1000 times smaller — a statement which makes sense even 
in an infinitely large universe. The Hubble parameter, which is equal to 


Hay = 4, (1.10) 
a 


measures the expansion rate. 

In this description, we are assuming a perfectly homogeneous and isotropic 
universe in which all observers are comoving in the sense that their coordinates x 
remain unchanged. In the real universe, wherever matter is concentrated, the motion 
of nearby objects is dominated by the inhomogeneities in the gravitational field, 
which lead, for example, to virial orbital motion rather than Hubble expansion. 
Similarly, objects held together by other, stronger forces resist Hubble expansion. 
The velocity of these objects relative to comoving observers is referred to as the “pe- 
culiar” velocity. Hence, the Hubble law is valid only on the scales of homogeneity. 


Problem 1.2 Typical peculiar velocities of galaxies are about a few hundred kilo- 
meters per second. The mean distance between large galaxies is about 1 Mpc. How 
distant must a galaxy be from us for its peculiar velocity to be small compared to 
its comoving (Hubble) velocity, if the Hubble parameter is 75 km s~! Mpc™!? 


The current value of the Hubble parameter, Ho, can be determined by measuring 
the ratio of the recession velocity to the distance for an object whose peculiar 
velocity is small compared to its comoving velocity. The recessional velocity can 
be accurately measured because it induces a Doppler shift in spectral lines. The 
challenge is to find a reliable measure of the distance. Two methods used are based 
on the concepts of “standard candles” and “standard rulers.” A class of objects is 
called a standard candle if the objects have about the same luminosity. Usually, they 
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possess a set of characteristics that can be used to identify them even when they 
are far away. For example, Cepheid variable stars pulse at a periodic rate, and Type 
IA supernovae are bright, exploding stars with a characteristic spectral pattern. The 
distances to nearby objects in the class are measured directly (for example, by 
parallax) or by comparing them to another standard candle whose distance has 
already been calibrated. Once the distance to a subset of a given standard candle class 
has been measured, the distance to further members of that class can be determined: 
the inverse square law relates the apparent luminosity of the distant objects to that of 
the nearby objects whose distance is already determined. The standard ruler method 
is exactly like the standard candle method except that it relies on identifying a class 
of objects of the same size rather than the same luminosity. It is clear, however, that 
only if the variation in luminosity or size of objects within the same class is small 
can they be useful for measuring the Hubble parameter. Cepheid variable stars have 
been studied for nearly a century and appear to be good standard candles. Type IA 
supernovae are promising candidates which are potentially important because they 
can be observed at much greater distances than Cepheids. Because of systematic 
uncertainties, the value of the measured Hubble constant is known today with only 
modest accuracy and is about 65-80 km s~! Mpc!. 

Knowing the value of the Hubble constant, we can obtain a rough estimate 
for the age of the universe. If we neglect gravity and consider the velocity to be 
constant in time, then two points separated by |r| today, coincided in the past, 
to < |r|/|v| = 1/Ho ago. For the measured value of the Hubble constant, fo is 
about 15 billion years. We will show later that the exact value for the age of the 
universe differs from this rough estimate by a factor of order unity, depending on 
the composition and curvature of the universe. 

Because the Hubble law has a kinematical origin and its form is dictated by the 
requirement of homogeneity and isotropy, it has to be valid in both Newtonian theory 
and General Relativity. In fact, rewritten in the form (1.8), it can be immediately 
applied in Einstein’s theory. This remark may be disconcerting since, according to 
the Hubble law, the relative velocity can exceed the speed of light for two objects 
separated by a distance larger than 1/H. How can this be consistent with Special 
Relativity? The resolution of the paradox is that, in General Relativity, the relative 
velocity has no invariant meaning for objects whose separation exceeds 1/H, which 
represents the curvature scale. We will explore this point further in context of the 
Milne universe (Section 1.3.5), following the discussion of Newtonian cosmology. 


1.2 Dynamics of dust in Newtonian cosmology 


We first consider an infinite, expanding, homogeneous and isotropic universe filled 
with “dust,’ a euphemism for matter whose pressure p is negligible compared 
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to its energy density e. (In cosmology the terms “dust” and “matter” are used 
interchangeably to represent nonrelativistic particles.) Let us choose some arbitrary 
point as the origin and consider an expanding sphere about that origin with radius 
R(t) = a(t)Xcom. Provided that gravity is weak and the radius is small enough that 
the speed of the particles within the sphere relative to the origin is much less than 
the speed of light, the expansion can be described by Newtonian gravity. (Actually, 
General Relativity is involved here in an indirect way. We assume the net effect on 
a particle within the sphere due to the matter outside the sphere is zero, a premise 
that is ultimately justified by Birkhoff’s theorem in General Relativity.) 


1.2.1 Continuity equation 


The total mass M within the sphere is conserved. Therefore, the energy density due 
to the mass of the particles is 


@e—" = ). aa 
(OF GeO Law) ’ a 


where &o is the energy density at the moment when the scale factor is equal ap. It 
is convenient to rewrite this conservation law in differential form. Taking the time 
derivative of (1.11), we obtain 


3) x 
si -3eo() — —3He(t). (1.12) 


This equation is a particular case of the nonrelativistic continuity equation, 


dE 

— =-V(EV), 1.13 

my (ev) (1.13) 
if we take e(x, t) = e(t) and v =A (f)r. Beginning with the continuity equation 
and assuming homogeneous initial conditions, it is straightforward to show that the 
unique velocity distribution which maintains homogeneity evolving in time is the 
Hubble law: v =H (t)r. 


1.2.2 Acceleration equation 


Matter is gravitationally self-attractive and this causes the expansion of the universe 
to decelerate. To derive the equation of motion for the scale factor, consider a probe 
particle of mass m on the surface of the sphere, a distance R(t) from the origin. 
Assuming matter outside the sphere does not exert a gravitational force on the 
particle, the only force acting is due to the mass M of all particles within the 


10 Kinematics and dynamics of an expanding universe 


sphere. The equation of motion, therefore, is 


2s GmM 4a M 
mR =— = ——Gm R. 
R? 3 (47 /3)R3 
Using the expression for the energy density in (1.11) and substituting R(t) = 
A(t)Xcom, We obtain 





(1.14) 


4 
= - Gea. (1.15) 


The mass of the probe particle and the comoving size of the sphere x,o drop out 
of the final equation. 

Equations (1.12) and (1.15) are the two master equations that determine the 
evolution of a(t) and e(t). They exactly coincide with the corresponding equations 
for dust (p = 0) in General Relativity. This is not as surprising as it may seem 
at first. The equations derived do not depend on the size of the auxiliary sphere 
and, therefore, are exactly the same for an infinitesimally small sphere where all 
the particles move with infinitesimal velocities and create a negligible gravitational 
field. In this limit, General Relativity exactly reduces to Newtonian theory and, 
hence, relativistic corrections should not arise. 


1.2.3 Newtonian solutions 


The closed form equation for the scale factor is obtained by substituting the expres- 
sion for the energy density (1.11) into the acceleration equation (1.15): 


An ae 
pa Ge". 1.16 
a 3 Fe (1.16) 


Multiplying this equation by a and integrating, we find 


1 
sv +V(a)= E, (1.17) 


where FE is a constant of integration and 


Equation (1.17) is identical to the energy conservation equation for a rocket 
launched from the surface of the Earth with unit mass and speed a. The integration 
constant F represents the total energy of the rocket. Escape from the Earth occurs 
if the positive kinetic energy overcomes the negative gravitational potential or, 
equivalently, if E is positive. If the kinetic energy is too small, the total energy E is 
negative and the rocket falls back to Earth. Similarly, the fate of the dust-dominated 
universe — whether it expands forever or eventually recollapses — depends on the 
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sign of E. As pointed out above, the normalization of a has no invariant meaning 
in Newtonian gravity and it can be rescaled by an arbitrary factor. Hence, only the 
sign of E is physically relevant. Rewriting (1.17) as 

H? — “ = ee (1.18) 
we see that the sign of F is determined by the relation between the Hubble parameter, 
which determines the kinetic energy of expansion, and the mass density, which 
defines the gravitational potential energy. 

In the rocket problem, the mass of the Earth is given and the student is asked to 
compute the minimal escape velocity by setting E = 0 and solving for the veloc- 
ity v. Incosmology, the expansion velocity, as set by the Hubble parameter, has been 
reasonably well measured while the mass density was very poorly determined for 
most of the twentieth century. For this historical reason, the boundary between es- 
cape and gravitational entrapment is traditionally characterized by a critical density, 
rather than critical velocity. Setting E = 0 in (1.18), we obtain 

3H 


a 1.19 
‘ 8G ( ) 


The critical density decreases with time since H is decreasing, though the term 
“critical density” is often used to refer to its current value. Expressing E in terms 
of the energy density e(t) and the Hubble constant H(t), we find 





_4nG 4 BN AWG 5 iy 
E=— a's (1 2 =) =e" 1-201, (1.20) 
where 
t 
Q(t) = a (1.21) 


is called the cosmological parameter. Generally, (2(t) varies with time, but because 
the sign of E is fixed, the difference 1 — Q2(t) does not change sign. Therefore, by 
measuring the current value of the cosmological parameter, Q9 = Q(fo), we can 
determine the sign of E. 

We shall see that the sign of E determines the spatial geometry of the universe 
in General Relativity. In particular, the spatial curvature has the opposite sign to E. 
Hence, in a dust-dominated universe, there is a direct link between the ratio of the 
energy density to the critical density, the spatial geometry and the future evolution 
of the universe. If 29 = €9/€5’ > 1, then E < Oand the spatial curvature is positive 
(closed universe). In this case the scale factor reaches some maximal value and the 
universe recollapses, as shown in Figure 1.3. When Qo < 1, E is positive, the spatial 
curvature is negative (open universe), and the universe expands hyperbolically. The 
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a(t) 





Fig. 1.3. 


special case of 2 = 1, or E = 0, corresponds to parabolic expansion and flat spatial 
geometry (flat universe). For both flat and open cases, the universe expands forever 
at an ever-decreasing rate (Figure 1.3). In all three cases, extrapolating back to a 
“beginning,” we face an “initial singularity,’ where the scale factor approaches zero 
and the expansion rate and energy density diverge. 

The reader should be aware that the connection between Qo and the future 
evolution of the universe discussed above is not universal, but depends on the 
matter content of the universe. We will see later that it is possible to have a closed 
universe that never recollapses. 


Problem 1.3 Show that d — oo, H — oo and € — oo whena — 0. 


Problem 1.4 Show that, for the expanding sphere of dust, Q(t) is equal to the 
absolute value of the ratio of the gravitational potential energy to the kinetic en- 
ergy. Since dust is gravitationally self-attractive, it decelerates the expansion rate. 
Therefore, in the past, the kinetic energy was much larger than at present. To satisfy 
the energy conservation law, the increase in kinetic energy should be accompa- 
nied by an increase in the magnitude of the negative potential energy. Show that, 
irrespective of its current value, Q(t) —> 1 asa —> 0. 


Problem 1.5 Another convenient dimensionless parameter that characterizes the 
expansion is the “deceleration parameter”: 


FF 


=—-—~. 1:22 
q aH (1.22) 
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The sign of g determines whether the expansion is slowing down or speeding up. 
Find a general expression for g in terms of Q and verify that g = 1/2 in a flat 
dust-dominated universe. 


To conclude this section we derive an explicit solution for the scale factor in a 
flat matter-dominated universe. Because E = 0, (1.17) can be rewritten as 





4 (da3?\” 
ro) 
: = = const, 1.23 
oe 0 ( di ) nee) 
and, hence, its solution is 
qr: (1.24) 
For the Hubble parameter, we obtain 
ee (1.25) 
— 3t 
Thus, the current age of a flat (E = 0) dust-dominated universe is 
to = : (1.26) 
0= 3H 5 . 


where Hp is the present value of the Hubble parameter. We see that the result is not 
very different from the rough estimate obtained by neglecting gravity. The energy 
density of matter as a function of cosmic time can be found by substituting the 
Hubble parameter (1.25) into (1.18): 


e(t) = (1.27) 


61 Gt?” 
Problem 1.6 Estimate the energy density at t = 10~** s, 1 s and 1 year after the 
big bang. 


Problem 1.7 Solve (1.18) in the limit t + oo for an open universe and discuss the 
properties of the solution. 


1.3 From Newtonian to relativistic cosmology 


General Relativity leads to a mathematically consistent theory of the universe, 
whereas Newtonian theory does not. For example, we pointed out that the 
Newtonian picture of an expanding, dust-filled universe relies on Birkhoff’s theo- 
rem, which is proven in General Relativity. In addition, General Relativity intro- 
duces key changes to the Newtonian description. First, Einstein’s theory proposes 
that geometry is dynamical and is determined by the matter composition of the 
universe. Second, General Relativity can describe matter moving with relativistic 
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velocities and having arbitrary pressure. We know that radiation, which has a pres- 
sure equal to one third of its energy density, dominated the universe for the first 
100 000 years after the big bang. Additionally, evidence suggests that most of the 
energy density today has negative pressure. To understand these important epochs 
in cosmic history, we are forced to go beyond Newtonian gravity and turn to a fully 
relativistic theory. We begin by considering what kind of three-dimensional spaces 
can be used to describe a homogeneous and isotropic universe. 


1.3.1 Geometry of an homogeneous, isotropic space 


The assumption that our universe is homogeneous and isotropic means that its evolu- 
tion can be represented as a time-ordered sequence of three-dimensional space-like 
hypersurfaces, each of which is homogeneous and isotropic. These hypersurfaces 
are the natural choice for surfaces of constant time. 

Homogeneity means that the physical conditions are the same at every point of 
any given hypersurface. Isotropy means that the physical conditions are identical 
in all directions when viewed from a given point on the hypersurface. Isotropy 
at every point automatically enforces homogeneity. However, homogeneity does 
not necessarily imply isotropy. One can imagine, for example, a homogeneous yet 
anisotropic universe which contracts in one direction and expands in the other two 
directions. 

Homogeneous and isotropic spaces have the largest possible symmetry group; 
in three dimensions there are three independent translations and three rotations. 
These symmetries strongly restrict the admissible geometry for such spaces. There 
exist only three types of homogeneous and isotropic spaces with simple topology: 
(a) flat space, (b) a three-dimensional sphere of constant positive curvature, and 
(c) a three-dimensional hyperbolic space of constant negative curvature. 

To help visualize these spaces, we consider the analogous two-dimensional ho- 
mogeneous, isotropic surfaces. The generalization to three dimensions is straight- 
forward. Two well known cases of homogeneous, isotropic surfaces are the plane 
and the 2-sphere. They both can be embedded in three-dimensional Euclidean space 
with the usual Cartesian coordinates x, y, z. The equation describing the embedding 
of a two-dimensional sphere (Figure 1.4) is 


xo + y? +277 =a’, (1.28) 


where a is the radius of the sphere. Differentiating this equation, we see that, for 
two infinitesimally close points on the sphere, 





_xdx + ydy or: xdx + ydy 
= 2 


dz= : 
Z a* — x? — y? 
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Fig. 1.4. 


Substituting this expression into the three-dimensional Euclidean metric, 
dl? = dx* + dy’? + dz’, (1.29) 
gives 


(xdx + ydy)* 


dl?’ = dx? +dy* + 
7 y Pane 


(1.30) 
In this way, the distance between a pair of points located on the 2-sphere is expressed 
entirely in terms of two independent coordinates x and y, which are bounded, 
KEE yr < a’. These coordinates, however, are degenerate in the sense that to every 
given (x, y) there correspond two different points on the sphere located in the 
northern and southern hemispheres. It is convenient to introduce instead of x and 
y the angular coordinates r’, g defined in the standard way: 


x=r'cosg,y=r'sing. (1.31) 


Differentiating the relation x7 + y? = r” 


, we have 
xdx + ydy =r'dr'. 
Combining this with 


dx? + dy? =dr? +r°dg’, 
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the metric in (1.30) becomes 


dr” 


a 
saa er TE 


+r?dg’. (1.32) 
The limit a* —> oo corresponds to a (flat) plane. We can also formally take a? 
to be negative and then metric (1.32) describes a homogeneous, isotropic two- 
dimensional space with constant negative curvature, known as Lobachevski space. 
Unlike the flat plane or the two-dimensional sphere, Lobachevski space cannot be 
embedded in Euclidean three-dimensional space because the radius of the “sphere” 
a is imaginary (this is why this space is called a pseudo-sphere or hyperbolic 
space). Of course, this does not mean that this space cannot exist. Any curved 
space can be described entirely in terms of its internal geometry without referring 
to its embedding. 


Problem 1.8 Lobachevski space can be visualized as a hyperboloid in Lorentzian 
three-dimensional space (Figure 1.5). Verify that the embedding of the surface 
x? + y? — 7 = —a’, where a? is positive, in the space with metric dl* = dx* + 
dy” — dz? gives a Lobachevski space. 


Introducing the rescaled coordinate r = r’/,/|a”|, we can recast metric (1.32) 
as 





dl’ = il 
i 


where k = +1 for the sphere (a > 0), k = —1 for the pseudo-sphere (a? < 0) and 
k = Ofor the plane (two-dimensional flat space). In curved space, |a?| characterizes 
the radius of curvature. In flat space, however, the normalization of |a?| does not 
have any physical meaning and this factor can be absorbed by redefinition of the 
coordinates. The generalization of the above consideration to three dimensions is 
straightforward. 


dr? 
ape Pay?) (1.33) 
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Problem 1.9 By embedding a three-dimensional sphere (pseudo-sphere) in a 
four-dimensional Euclidean (Lorentzian) space, verify that the metric of a three- 
dimensional space of constant curvature can be written as 





d 2 
di2, = ( - 5 +r7(d0? + sin od¢*)) (1.34) 
—kr 
where a? is positive and k = 0, +1. Introduce the rescaled radial coordinate 7, 
defined by 
ae (1.35) 
r = 9 . 
1+ kr?/4 


and show that this metric can then be rewritten in explicitly isotropic form: 


(dx? + dy? + dz’) 
(1 + kF2/4)2 





dla ' (1.36) 


where 
xX=rsinOcosg, y=*/fsinOsing, Z=/cosé. 


In many applications, instead of the radial coordinate r, it is convenient to use 
coordinate x defined via the relation 


dr? 
dx? = ——_... 1.37 
x ok? ee 
It follows that 
arcsinhr, k= -—\1; 
X= 47, k=0; (1.38) 


arcsin r k=. 


The coordinate x varies between 0 and +00 in flat and hyperbolic spaces, while 
mw > x => 0 in spaces with positive curvature (kK = +1). In this last case, to every 
particular r correspond two different x . Thus, introducing x removes the coordinate 
degeneracy mentioned above. In terms of x, metric (1.34) takes the form 


sinh? x k=-1; 
di2, = a?(dx? + ©(x)dQ) =a" | dx? +(x? | dQ? |}k=0; (1.39) 
sin? x k=-+1, 
where 
dQ? = (dé* + sin’ Odg’). (1.40) 


Let us now take a closer look at the properties of the constant curvature spaces. 
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Three-dimensional sphere (k = +1) It follows from (1.39) that in a three- 
dimensional space with positive curvature, the distance element on the surface 
of a 2-sphere of radius x is 


dl? =a’ sin’ x(d6* + sin’ 0d’). (1.41) 


This expression is the same as for a sphere of radius R = asin x in flat three- 
dimensional space, and hence we can immediately find the total surface area: 


Soa(X) = 4 R? = 4a’ sin? x. (1.42) 


As the radius x increases, the surface area first grows, reaches its maximal value 
at x = 2/2, and then decreases, vanishing at x = 7 (Figure 1.6). 

To understand such unusual behavior of a surface area, it is useful to turn to a 
low-dimensional analogy. In this analogy, the surface of the globe plays the role of 
three-dimensional space with constant curvature and the two-dimensional surfaces 
correspond to circles of constant latitude on the globe. Beginning from the north 
pole, corresponding to 6 = 0, the circumferences of the circles grow as we move 
southward, reach a maximum at the equator, where 0 = 7/2, then decrease below 
the equator and vanish at the south pole, 9 = zr. As 6 runs from 0 to 7, it covers the 
whole surface of the globe. Similarly, as x changes from 0 to z, it sweeps out the 
whole three-dimensional space of constant positive curvature. Because the total 
area of the globe is finite, we expect that the total volume of the three-dimensional 
space with positive curvature is also finite. 


Sod(x) 


4a 











Fig. 1.6. 
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In fact, since the physical width of an infinitesimal shell is dl = ad x, the volume 
element between two spheres with radii x and x + dx is 


dV = Sygadx = 4a’ sin’ xdx. 


Therefore, the volume within the sphere of radius xo is 
Xo 
V(xo) = 420° / sin? xd x = 2ma°(xo — 5 sin2x0). (1.43) 
0 


For xo < 1, the volume, 
V (Xo) = 4r(axo)?/3 +-+- 


grows in the same way as in Euclidean space. The total volume, obtained by sub- 
stituting xo = z in (1.43), is finite and equal to 


V =2n’a’. (1.44) 


The other distinguishing property of a space of constant positive curvature is that 
the sum of the angles of a triangle constructed from geodesics (curves of minimal 
length) is larger than 180 degrees. 


Three-dimensional pseudo-sphere (k = —1) The metric on the surface of a 2- 
sphere of radius x in a three-dimensional space of constant negative curvature 
is 

dl* = a’ sinh” x(dé” + sin’ 6d”), (1.45) 
and the area of the sphere, 


Soa(x) = 4sra’ sinh? x, (1.46) 


increases exponentially for x >> 1. Since the coordinate x varies from 0 to +00, 
the total volume of the hyperbolic space is infinite. The sum of angles of a triangle 
is less than 180 degrees. 


Problem 1.10 Calculate the volume of a sphere with radius xo in a space with 
constant negative curvature. 


1.3.2 The Einstein equations and cosmic evolution 


The only way to preserve the homogeneity and isotropy of space and yet incorporate 
time evolution is to allow the curvature scale, characterized by a, to be time- 
dependent. The scale factor a(t) thus completely describes the time evolution of 
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a homogeneous, isotropic universe. In relativistic theory, there is no absolute time 
and spatial distances are not invariant with respect to coordinate transformations. 
Instead, the infinitesimal spacetime interval between events is invariant. There exist, 
however, preferred coordinate systems in which the symmetries of the universe are 
clearly manifest. In one of the most convenient of such coordinate systems, the 
interval takes the form 


dr* 

as? = dt? — aly = at? — a°()( + rd?) gaps (1.47) 
—kr 

where gq is the metric of the spacetime and x* =(t, r, 0, g) are the coordinates of 

events. We will use the Einstein convention for summation over repeated indices: 


Sapdx*dx? = S > Bapdx*dx?. 
a, B 


Additionally, we will always choose Greek indices to run from 0 to 3 with 0 re- 
served for the time-like coordinate. Latin indices run only over spatial coordinates: 
i,l,... = 1, 2,3. The spatial coordinates introduced above are comoving; that is, 
every object with zero peculiar velocity has constant coordinates r, 0, g. Further- 
more, the coordinate ¢ is the proper time measured by a comoving observer. The 
distance between two comoving observers at a particular moment of time is 


/ \ —dS}<const x a(t) 


and, therefore, increases or decreases in proportion to the scale factor. 
In General Relativity, the dynamical variables characterizing the gravitational 
field are the components of the metric g.g(x”) and they obey the Einstein equations: 








a bo. as " : 
a ae a ea ae (1.48) 
Here 
Obes: cals 
aoa vB yé Se ee diss 
<a> : ax>  axB tT P56 — Vy sl po (1.49) 


is the Ricci tensor expressed in terms of the inverse metric g°”, defined via 
g°” gg = 6%, and the Christoffel symbols 


1 0 0 0 
a ( Sys + &5p _ it) ’ (1.50) 





vB ~ 5 ax8 Ox” ax 


The symbol 5% denotes the unit tensor, equal to 1 when a = £ and 0 otherwise; 
R = R¢ is the scalar curvature; and A = const is the cosmological term. Matter 


1.3 From Newtonian to relativistic cosmology 21 


is incorporated in Einstein’s equations through the energy-momentum tensor, T;'. 
(In General Relativity the term “matter” is used for anything not the gravitational 
field.) This tensor is symmetric, 


PES ghee Tee, (1.51) 
and is (almost unambiguously) determined by the condition that the equations 
aT? /ax? =0 (1.52) 


must coincide with the equations of motion for matter in Minkowski spacetime. To 
generalize to curved spacetime, the equations of motion are modified: 
aT 
op a ryb B ray _ 

= ax8 +TygT +V PF" =9, (1.53) 
where the terms proportional to account for the gravitational field. Note that in 
General Relativity these equations do not need to be postulated separately. They fol- 
low from the Einstein equations as a consequence of the Bianchi identities satisfied 
by the Einstein tensor: 


Co: (1.54) 


On large scales, matter can be approximated as a perfect fluid characterized by 
energy density ©, pressure p and 4-velocity u®%. Its energy-momentum tensor is 


Ty = (e+ p)u“up — poy, (1.55) 


where the equation of state p = p(e) depends on the properties of matter and must 
be specified. For example, if the universe is composed of ultra-relativistic gas, the 
equation of state is p = ¢/3. In many cosmologically interesting cases p = we, 
where w is constant. 


Problem 1.11 Consider a nonrelativistic, dust-like perfect fluid (u° ~lu <1, 
p <) ina flat spacetime. Verify that the equations T°", = 0 are equivalent to 
the mass conservation law plus the Euler equations of motion. 


Another important example of matter is a classical scalar field g with potential 
V(q). In this case, the energy-momentum tensor is given by the expression 


1 
Ts = 9°06 — (5076 = vo) ve (1.56) 


where 


" 
Cr ah Se Oy 
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Problem 1.12 Show that the equations of motion for the scalar field, 


av 


ga + a 0, (1.57) 


follow from Ts a= 0. 


If g-’ gy, > 0, then the energy-momentum tensor for a scalar field can be rewrit- 
ten in the form of a perfect fluid ( 1.55) by defining 


€=59" Gy +VQ) P=59"'%y-VQY), ur =o"/J/E’G,. (158) 


In particular, assuming that the field is homogeneous (0¢/dx' = 0), we have 


6€= 390° + V9), p= 50 — VQ). (1.59) 


For a scalar field, the ratio w = p/e is, in general, time-dependent. Additionally, 
w is bounded from below by —1 for any positive potential V and the weak energy 
dominance condition, ¢ + p > 0, is satisfied. However, the strong energy domi- 
nance condition, ¢ + 3p > 0, can easily be violated by a scalar field. For example, 
if a potential V(g) has a local minimum at some point go, then g(t) = @o is a 
solution of the scalar field equations, for which 


p=—& =—V(Gp). (1.60) 


As far as Einstein’s equations are concerned, the corresponding energy-momentum 
tensor, 


Ts = V(G0)dz, (1.61) 
imitates a cosmological term 


A = 82GV(q). (1.62) 


The cosmological term can therefore always be interpreted as the contribution of 
vacuum energy to the Einstein equations and from now on we include it in the 
energy-momentum tensor of matter and set A = 0 in (1.48). 


1.3.3 Friedmann equations 


How are the Newtonian equations of cosmological evolution (1.12), (1.15) and 
(1.18) modified when matter is relativistic? In principle, to answer this question we 
must simply substitute the metric (1.47) and energy-momentum tensor (1.55) into 
the Einstein equations (1.48). The resulting equations are the Friedmann equations 
and they determine the two unknown functions a(t) and e(t) . However, rather than 
starting with this formal derivation, it is instructive to explain how the nonrelativistic 
equations (1.12) and (1.15) must be modified. 
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If the pressure p within an expanding sphere of volume V is significant, then 
the total energy, E = eV, is no longer conserved because the pressure does work, 
—pdV. According to the first law of thermodynamics, this work must be equal to 
the change in the total energy: 


dE = —pdV. (1.63) 
Since V « a3, we can rewrite this conservation law as 
de = —3(e + p)dina (1.64) 
or, equivalently, 
é=-—3H(e+ p). (1.65) 


This relation is the new version of (1.12) and it turns out to be the energy conser- 
vation equation, 7)’,, = 0, in an isotropic, homogeneous universe. 

The acceleration equation is also modified for matter with nonnegligible pressure 
since, according to General Relativity, the strength of the gravitational field depends 
not only on the energy density but also on the pressure. Equation (1.15) becomes 
the first Friedmann equation: 


- An 
a= iq GE OP )a: (1.66) 


The real justification for the form of the pressure contribution is that the accelera- 
tion equation (1.66) follows from any diagonal spatial component of the Einstein 
equations. Multiplying (1.66) by a, using (1.65) to express p in terms of ¢, & and 
H, and integrating, we obtain the second Friedmann equation: 


H+ = = ene (1.67) 
a 3 
This looks like the Newtonian equation (1.18) withk = —2E, though (1.67) applies 
for an arbitrary equation of state. However, k is not simply a constant of integra- 
tion: the 0 — 0 Einstein equation tells us that it is exactly the curvature introduced 
before, that is, A = +1 or 0. For k = +1, the magnitude of the scale factor a has a 
geometrical interpretation as the radius of curvature. 

Thus, in General Relativity, the value of cosmological parameter, Q = ¢/e°’, 
determines the geometry. If & > 1, the universe is closed and has the geome- 
try of a three-dimensional sphere (k = +1); Q = 1 corresponds to a flat universe 
(k = 0); and in the case of Q < 1, the universe is open and has hyperbolic geometry 
(k = —1). 

The combination of (1.67) and either the conservation law (1.65) or the accel- 
eration equation (1.66 ), supplemented by the equation of state p = p(e), forms a 
complete system of equations that determines the two unknown functions a(t) and 
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e(t). The solutions, and hence the future of the universe, depend not only on the 
geometry but also on the equation of state. 


Problem 1.13 From (1.65) and (1.67), derive the following useful relation: 
: k 
H = —4nG(ée+ p)+ —. (1.68) 
a 


Problem 1.14 Show that, for p > —e/3, a closed universe recollapses after reach- 
ing a maximal radius while flat and open universes continue to expand forever. 
Verify that the spatial curvature term in (1.67), k/a?, can be neglected as a > 0 
and give a physical interpretation of this result. Analyze the behavior of the scale 
factor for the case —e/3 > p > —e. 


To conclude this section, let us reiterate the most important distinctions be- 
tween the Newtonian and relativistic treatments of a homogeneous, isotropic uni- 
verse. First, the Newtonian approach is incomplete: it is only valid (with justi- 
fication from General Relativity) for nearly pressureless matter on small scales, 
where the relative velocities due to expansion are small compared to the speed 
of light. In Newtonian cosmology, the spatial geometry is always flat and, conse- 
quently, the scale factor has no geometrical interpretation. General Relativity, by 
contrast, provides a complete, self-consistent theory which allows us to describe 
relativistic matter with any equation of state. This theory is applicable on arbi- 
trarily large scales. The matter content determines the geometry of the universe 
and, if k = +1, the scale factor has a geometrical interpretation as the radius of 
curvature. 


1.3.4 Conformal time and relativistic solutions 


To find particular solutions of the Friedmann equations it is often convenient to 
replace the physical time ¢ with the conformal time 7, defined as 


dt 


so that dt = a(n)dn. Equation (1.67) can then be rewritten as 





82G 
a” +ka* = - ea’, (1.70) 


where prime denotes the derivative with respect to 7. Differentiating with respect 
to n and using (1.64), we obtain 
4G 
a" +ka= “4 © —3p)a’. (1.71) 
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This last equation, which corresponds to the trace of the Einstein equations, is useful 
for finding analytic solutions for a universe filled by dust and radiation. 

In the case of radiation, p = ¢/3, the expression on the right hand side of (1.71) 
vanishes and the equation reduces to 


a’ +ka=0. (1.72) 


This is easily integrated and the result is 


sinhn, k=-—1; 
a(n) = 4m- ¥ 0, k=0; (1.73) 
sinn, k=. 


Here a,, is one constant of integration and the other has been fixed by requiring 
a(n = 0) = 0. The physical time ¢ is expressed in terms of 7 by integrating the 
relation dt = adn: 


(coshn—1), k=-—1; 
f= a,,.2 4 9/2, k=0; (1.74) 
(l—cosn), k=. 
It follows that in the most interesting case of a flat radiation-dominated universe, 
the scale factor is proportional to the square root of the physical time, a « 4/t, and 
hence H = 1/2t. Substituting this into (1.67), we obtain 
3 
= —___ xa “* 
322 Gt? 


Alternatively, the energy conservation equation (1.64) for radiation takes the form 


(1.75) 


Ey 


dé, = —4e,d Ina, (1.76) 


also implying that e, x a~*. 

Problem 1.15 Find H(7) and (Q(7) in open and closed radiation-dominated uni- 
verses and express the current age of the universe fp in terms of Hp and Qo. Analyze 
the result for Qo « 1 and give its physical interpretation. 


Problem 1.16 For dust, p = 0, the expression on the right hand side of (1.71) is 
constant and solutions of this equation can easily be found. Verify that 


(coshn—1), kK=-—1; 
aq) =m * {77, k =0; (1.77) 
(l—cosn), k=+1. 


For each case, compute H(n) and Q2(n) and express the age of the universe in terms 
of Ho and Qo. Show that in the limit Qo — 0, we have t9 = 1/Hp, in agreement 
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with the Newtonian estimate obtained by ignoring gravity. (Hint Use (1.70) to fix 
one of the constants of integration.) 


The range of conformal time n in flat and open universes is semi-infinite, -+-oo > 
n > 0, regardless of whether the universe is dominated by radiation or matter. For 
a closed universe, 7 is bounded: z > n > 0 and 27 > n > O in the radiation- and 
matter-dominated universes respectively. 

Finally, we consider the important case of a flat universe with a mixture of matter 
(dust) and radiation. The energy density of matter decreases as 1 /a? while that of 
radiation decays as 1 /a*. Therefore, we have 


eS iy bees “ ((“2): +(“)’) : (1.78) 


where deg is the value of the scale factor at matter—radiation equality, when €,, = €,. 
Equation (1.71) now becomes 





21G 
a = J Fea%eq (1.79) 
and has a simple solution: 
mG 
a(n) = Fea4eg + Cn. (1.80) 


Again, we have fixed one of the two constants of integration by imposing the 
condition a(n = 0) = 0. Substituting (1.78) and (1.80 ) into (1.70), we find the 
other constant of integration: 


C= (42Ge.gad, /3)'”. 


2 
a(n) = Geq (7) 4 2(2)) (1.81) 
Ns Nx 


= (7Gega2,/3) = Neq/(W2 — 1) (1.82) 


Solution (1.80) is then 


where 


has been introduced to simplify the expression. (The relation between , and neq 
immediately follows from a(ieq) = deq.) For 1 K neq, radiation dominates and 
a «xn. As the universe expands, the energy density of radiation decreases faster 
than that of dust. Hence, for 7 >> neq, dust takes over and we have a x ie 
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Problem 1.17 Verify that, for a nonflat universe with a mixture of matter and 
radiation, one has 


(yn, sinhn+coshyn —1), k=-—1; 


(n, Sinn + 1 — cosy), k=+1. ne?) 


a(n) = am °* 


Problem 1.18 Consider a closed universe filled with matter whose equation of 
state is w = p/e, where w is constant. Verify that the scale factor is then 


l 3 2/(1+3w) 
IN =a, (sin( 5 ent c)) (1.84) 


where C is a constant of integration. Analyze the behavior of the scale factor for 
w = —1, -1/2, —1/3, 0 and +1/3. Find the corresponding solutions for flat and 
open universes. 





1.3.5 Milne universe 


Let us consider an open universe with k = —1 in the limit of vanishing energy 
density, ¢ — 0. In this case, (1.67) simplifies to 


a= 
and has a solution, a = t. The metric then takes the form 
ds* = dt — t*(dx* + sinh’ xd’), (1.85) 


and describes a spacetime known as a Milne universe. One might naturally expect 
that the solution of the Einstein equations for an isotropic space without matter 
must be Minkowski spacetime. Indeed, the Milne universe is simply a piece of 
Minkowski spacetime described in expanding coordinates. To prove this, we begin 
with the Minkowski metric, 


as Sao Sart = rd. (1.86) 


Replacing the Minkowski coordinates t and r by the new coordinates ¢ and x, 
defined via 


tT=tcoshxy, r=tsinhx, (1.87) 
we find that 


de? jdr jdt = dx", 
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Xx = const 


vs 





t = const 





Fig. 1.7. 


and hence the Minkowski metric reduces to (1.85). A particle with a given comoving 
coordinate x moves with constant velocity 


lv| =r/t =tanhy <1 (1.88) 


in Minkowski space and its proper time, ,/ 1 — |v|?t, is equal to the cosmological 
time t. To find the hypersurfaces of constant proper time t, we note that 


P=t—r’. (1.89) 


The hypersurface t = 0 coincides with the forward light cone; the surfaces of con- 
stant t > 0 are hyperboloids in Minkowski coordinates, all located within the for- 
ward light cone. Hence, the Milne coordinates cover only one quarter of Minkowski 
spacetime (Figure 1.7). 

Despite its obvious deficiencies as a practical model, the Milne universe does 
illustrate some useful points. First, it shows the similarities and differences between 
an explosion (the popular misconception of the “big bang” ) and Hubble expan- 
sion. The Milne universe has a center. It is apparent from the fact that the Milne 
coordinates cover only one particular quarter of Minkowski spacetime. The curved 
Friedmann universe has no center. Second, the Milne universe reveals the subtleties 
in the physical interpretation of recessional velocity. If the recessional velocity of 
a particle were defined as |u| = r/t = sinh x, it would exceed the speed of light 
for x > 1. Of course, there can be no contradiction with the principles of Spe- 
cial Relativity and we know that the particle is traveling on a physically allowed, 
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time-like world-line. Special Relativity says that the speed measured using rulers 
and clocks of the same inertial coordinate system never exceeds the speed of light. 
In the definition of |u|, however, we used the distance measured in the Minkowski 
coordinate system and the proper time of the moving particle. This corresponds 
to the spatial part of the 4-velocity, which can be arbitrarily large. The Hubble 
velocity in a Milne universe is also not bounded when defined in the usual way: 
UH = ax = x. Only |v| = tanh x is well defined. Although both |u| and vy are 
approximately equal to |v| for x < 1, for x > 1 they are very different and can 
have no invariant meaning. In curved spacetime, the situation is even more compli- 
cated. The inertial coordinate system can be introduced only locally, on scales much 
smaller than the four-dimensional curvature scale, roughly 1/H. Hence, the relative 
Minkowski velocity, the quantity which can never exceed the speed of light, is only 
defined for particles whose separation is much less than 1/H. Any definition of 
relative velocities at distances larger than the curvature scale, where the Hubble law 
predicts velocities which exceed unity, cannot have an invariant meaning. These 
remarks may be helpful in clarifying the notion of “superluminal expansion,” a 
confusing term sometimes used in the literature to describe inflationary expansion. 

The Milne solution is also useful as an illustration of the difference between 
3-curvature and 4-curvature. A “spatially flat” universe (k = 0) generically has 
nonzero 4-curvature. For example, in the case of a dust-dominated universe with 
Q = 1, space is nonempty and the Riemann tensor is nonzero. The Milne universe 
is a complementary example with nonzero spatial curvature (k = —1) but zero 
4-curvature. Milne coordinates correspond to foliating the locally flat spacetime 
with spatially curved homogeneous three-dimensional hypersurfaces. Hence, when- 
ever the term “flat” is used in cosmology, it is important to distinguish between 
3-curvature and 4-curvature. 

Generally, one does not have a choice of foliation if it is to respect the homogene- 
ity and isotropy of space. In particular, if the energy density is changing with time, 
the appropriate foliation is hypersurfaces of constant energy density. This choice is 
unique and has invariant physical meaning. Empty space, however, possesses extra 
time-translational invariance, so any space-like hypersurface has uniform “energy 
density” equal to zero. The other example of a homogeneous and isotropic space- 
time with extra time-translational invariance is de Sitter space. In the next section 
we will see that de Sitter space can be covered by three-dimensional hypersurfaces 
of constant curvature with open, flat and closed geometry. 


1.3.6 De Sitter universe 


The de Sitter universe is a spacetime with positive constant 4-curvature that is 
homogeneous and isotropic in both space and time. Hence, it possesses the largest 
possible symmetry group, as large as the symmetry group of Minkowski spacetime 
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(ten parameters in the four-dimensional case). In this book, we pay special attention 
to the de Sitter universe because it plays a central role in understanding the basic 
properties of inflation. In fact, in most scenarios, inflation is nothing more than a 
de Sitter stage with slightly broken time-translational symmetry. 

To find the metric of the de Sitter universe, we use three different approaches 
which illustrate different mathematical aspects of this spacetime. First, we obtain the 
de Sitter metric in a way similar to that discussed in Section 1.3.1, namely, as a result 
of embedding a constant curvature surface in a higher-dimensional flat spacetime. 
For the sake of simplicity, we perform all calculations for two-dimensional surfaces. 
The generalization to higher dimensions is straightforward. As a second approach, 
we analytically continue metric (1.39), describing a homogeneous, isotropic three- 
dimensional space of constant positive curvature with Euclidean signature, to obtain 
a constant curvature space with Lorentzian signature. Finally, we obtain de Sitter 
spacetime as a solution to the Friedmann equations with positive cosmological 
constant. 


De Sitter universe as a constant curvature surface embedded in Minkowski space- 
time (two-dimensional case) Let us consider a hyperboloid 


—7? + x2 + y? — |: ae (1.90) 
embedded in three-dimensional Minkowski space with the metric 
ds? = dz? — dx* — dy’. (1.91) 


This hyperboloid has positive curvature and lies entirely outside the light cone 
(Figure 1.8). Therefore, the induced metric has Lorentzian signature. (We noted 
in Problem 1.8 that Lobachevski space can also be embedded in a space with 
Lorentzian signature. However, Lobachevski space corresponds to a hyperbolic 
surface lying within the light cone and has an induced metric with Euclidean 
signature.) To parameterize the surface of the hyperboloid, we can use x and y 
coordinates. The metric of the hyperboloid can then be written as 


2 _ (edx + ydy)” 


= = dx dy’, (1.92) 
x24 y?— Hy? 


ds 


where x7 + y* > Hy. This is the metric of a two-dimensional de Sitter space- 
time in x, y coordinates. As with the cases considered in Section 1.3.1, it is more 
convenient to use coordinates in which the symmetries of the spacetime are more 
explicit. The first choice is t, x coordinates related to x, y via 


x= ie cosh(H,t)cosx, y= Hy! cosh(H,t) sin x. (1.93) 
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¢t = const 





Xx = const 


Fig. 1.8. 


These coordinates cover the entire hyperboloid for +oo > t > —oo and 2m > 
x > 0 (Figure 1.8), and metric (1.92) becomes 
ds* = dt? — Hx* cosh*(Hat)d x’. (1.94) 


In the four-dimensional case, this form of the metric corresponds to a closed universe 
with k = +1. 
Another choice of coordinates, namely, 


x = Hy! cosh(Hai), y = Hx! sinh(Hai) sinh X, (1.95) 
reduces (1.92) to a form corresponding to an open de Sitter universe: 
ds* = di* — Hx sinh’(Hai) dX’. (1.96) 


The range of these coordinates is +oo > f > 0 and +00 > X > —o0, covering 
only the part of de Sitter spacetime where x > Hy! and z > 0 (Figure 1.9). More- 
over, the coordinates are singular at f = 0. 

Finally, we consider the coordinate system defined via 


<i 1 2 : 
xs Hy! [eoshitis’ zh expla? | , y= Hy! exp(H,t)x, (1.97) 
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X = const 








Fig. 1.9. 


where +00 > f > —oo and +00 > X > —oo. Expressing z in terms of f, x, one 
finds that only the half of the hyperboloid located at x + z > 0 is covered by these 
“flat” coordinates (Figure 1.10). The metric becomes 


ds* = dt” — Hx’ exp (2H at) dX’. (1.98) 


The relation between the different coordinate systems in the regions where they 
overlap can be obtained by comparing (1.93), (1.95) and (1.97): 


cosh(H,t) cos x =cosh(H,f) = cosh(H,f) — t exp (Hat) x7, 


~ Z: 1. 
cosh(H,t) sin x =sinh(H,t) sinh ¥ = exp (Haft) x. oe? 


De Sitter spacetime via analytical continuation (three-dimensional case) Since a 
de Sitter universe is a spacetime of constant positive curvature with Lorentzian 
signature, it can be obtained by analytical continuation of a metric describing a 
positive curvature space with Euclidean signature. To see how analytical continu- 
ation changes the signature of the metric let us consider (1.39) describing a closed 
universe(k = +1). After the change of variables, 


a—> Hy',x > Hyt,6 > x,9> 9, 
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Fig. 1.10. 


it is recast as 
ds* = —di?, = —dt” — Hy? sin’(H,t)(dx? + sin” xd6”). (1.100) 


Then, analytically continuing t — it + 2/2, we obtain a three-dimensional de 
Sitter spacetime in the form of a closed Friedmann universe: 


ds* = dt? — Hx* cosh*(Hat)(dx? + sin? xd). (1.101) 


Note that the coordinate x varies only from 0 to 7, covering the entire space. The 
same construction works for a four-dimensional closed de Sitter universe. 

To obtain an open de Sitter metric we must analytically continue two coordinates 
in (1.100) simultaneously: t — if and x — i¥X, giving 


ds* = di’ — Hx? sinh?(H,i)(dX? + sinh? ¥d67). (1.102) 


Generalizing the procedure to four dimensions is again straightforward. 


De Sitter universe as a solution of Friedmann equations with cosmological constant 
(four-dimensional case) A cosmological constant is equivalent to a “perfect fluid” 
with equation of state py, = —é,. It follows from ( 1.64) that 


dey = —3(ey + py)dIna =0, 
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and hence the energy density stays constant during expansion. Substituting ¢, = 
const into (1.66), we obtain 


a—Ha=0, (1.103) 
where 
Hy = (82Ge, /3)!. 
A general solution of this equation is 
a = C, exp(Hat) + Co exp(—HA at), (1.104) 


where C, and C2 are constants of integration. These constants are constrained by 
Friedmann equation (1.67): 


4H,CiC2 =k. (1.105) 


Hence, in a flat universe (k = 0), one of the constants must be equal to zero. If 
C, #0 and C2 = 0, then (1.104) describes a flat expanding de Sitter universe and 
we can choose C; = ie If both C, and C) are nonzero, the time t = 0 can be 


chosen so that |C;| = |C2|. For a closed universe (k = +1), we have 
Ch =Q= = 
2H, 
while for an open universe (k = —1), 
Cy =-Q = ts, 
2H, 


The three solutions can be summarized as 


sinh?(H,t) sinh? x k=-1; 
ds’ = dt? — Hx*| exp(2Hat) | | dx? +] x2 | dQ? | k=0; (1.106) 
cosh?( Ht) sin? x k=+41, 


where the radial coordinate x changes from zero to infinity in flat and open uni- 
verses. In contrast to a matter-dominated universe, where the spatial curvature is 
determined by the energy density, here all three types of solutions exist for any given 
value of ey. They all describe the same physical spacetime in different coordinate 
systems. One should not be surprised that it is possible to cover the same spacetime 
using homogeneous and isotropic hypersurfaces with different curvatures, since de 
Sitter spacetime is translational invariant in time. Any space-like hypersurface is a 
constant density hypersurface. 

The behavior of the scale factor a(t), shown in Figure 1.11, depends on the 
coordinate system. In a closed coordinate system, the scale factor first decreases, 
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Fig. 1.11. 


then reaches its minimum value, and subsequently increases. In flat and open co- 
ordinates, a(t) always increases as t grows but vanishes as t ~ —oo and t = 0, 
respectively. However, the vanishing of the scale factor does not represent a real 
physical singularity but simply signals that the coordinates become singular. For 
t > Hx", the expansion is nearly the same in all coordinate systems, namely, ex- 
ponential with a « exp(/qf). 


Problem 1.19 Calculate H(t) and Q(t) in open and closed de Sitter universes. 
Verify that H(t) > Hy, and Q(t) > 1 as t > oo in both cases. 


In a pure de Sitter universe, there is no real evolution. In this sense, de Sitter 
spacetime is similar to Minkowski spacetime. As in the case of the Milne universe, 
the apparent expansion reflects the nonstatic character of the chosen coordinate 
systems. However, unlike Minkowski spacetime, there exists no static coordinate 
system which can cover de Sitter spacetime on scales exceeding Hy’. We will see 
later that only a de Sitter solution with slightly broken time-translational symmetry 
plays an important role in physical applications. The notion of de Sitter expan- 
sion is still useful in the presence of perturbations that break the exact symmetry, 
and the coordinate systems (1.106) are well suited to study the behavior of these 
perturbations and the subsequent exit from the de Sitter stage. 


Problem 1.20 Verify that for a flat universe filled with radiation and cosmological 
constant, the scale factor grows as 


a(t) = ao(sinh2Ht)"/”, (1.107) 
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where Hq is defined in (1.103). Analyze and discuss the behavior of this solution 
in the limits t ~ O and t > oo. Derive the corresponding solutions for k = +1. 
(Hint Use (1.71), replacing the conformal time with the physical time.) 


Problem 1.21 Show that the solution of (1.67) and (1.68) for a flat universe with 
cold matter (dust) and cosmological constant is 


2/3 
a(t) = ao( sin Ha) (1.108) 


Verify that in this case the age of the universe is given by 


; 2 1 1 14+ J1-Q,, 
= n 

” 3H V1 = Qn JV Qm 
where Hp is the current value of the Hubble constant and ,, is the cold matter 
contribution to the cosmological parameter today. 





, (1.109) 


Problem 1.22 Given a nonvanishing cosmological constant, find the static solution 
for a closed universe filled with cold matter (Einsteins’s universe). Why is this 
solution unstable? 


Problem 1.23 Find the solutions for an energy component with equation of state 
p = —e/3 in the presence of a cosmological term. Discuss the properties of these 
solutions. 


2 


Propagation of light and horizons 


We obtain most of the information about the universe from light. Over the last 
century, the development of x-ray, radio and infrared detectors has given us new 
windows on the universe. Understanding the propagation of light in an expanding 
universe is therefore critical to the interpretation of observations. 


Problem 2.1 Estimate the total amount of energy received by all optical telescopes 
over the course of the last century and compare this energy to that needed to return 
this book to your bookshelf. 


There is a fundamental limit to how far we can see, since no particles can travel 
faster than light. The finite speed of light leads to “horizons” and sets an absolute 
constraint on our ability to comprehend the entire universe. The term “horizon” 
is used in different contexts in the literature, often without clear definition, and 
one of the purposes of this chapter is to carefully delineate the various usages. 
We will study in detail conformal diagrams, which are a useful pictorial way of 
representing horizons and the causal global structure of spacetime. Finally, we 
discuss the basic kinematical tests which aim to measure the distance, angular 
size, speed and acceleration of distant objects. Using these tests, one can obtain 
information about the expansion rate and deceleration parameter at earlier times, 
and thus probe the evolutionary history of the universe. 


2.1 Light geodesics 


In Special Relativity, the spacetime interval along the trajectory of a massless 
particle propagating with the speed of light is equal to zero: 


ds* = 0. (2.1) 
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In General Relativity, the same must be true in every local inertial coordinate frame. 
Then, since the interval is invariant, the condition ds* = 0 should be valid along 
the light geodesic in any curved spacetime. 

We consider mainly the radial propagation of light in an isotropic universe in a 
coordinate system where the observer is located at the origin. The light trajectories 
look especially simple if, instead of physical time t, we use the conformal time 


_ / dt 
"an 
The metric (1.47) in n, x coordinates is 
ds* = a(n\(dn* — dx? — ®*(x)(d6? + sin’ 6dg’)), (2.2) 


where 


sinh? xy, k=—1; 
D(x) = 4 x? k =0; (2.3) 
sin? xy, k=. 


By symmetry, it is clear that the radial trajectory 6, = const is a geodesic. The 
function x(7) along the trajectory is then entirely determined by the condition 
ds? = 0, or 


dn? —dx* =0. (2.4) 
Hence, radial light geodesics are described by 
x(7) = =n + const, (2.5) 


and correspond to straight lines at angles + 45° in the n—x plane. 


2.2 Horizons 


Particle horizon Vf the universe has a finite age, then light travels only a finite 
distance in that time and the volume of space from which we can receive information 
at a given moment of time is limited. The boundary of this volume is called the 
particle horizon. Today, the universe is roughly 15 billion years old, so a naive 
estimate for the particle horizon scale is 15 billion light years. 

According to (2.5), the maximum comoving distance light can propagate is 


t 
dt 
WM) =7- 71 = re (2.6) 
tj 
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where 7; (or ¢;) corresponds to the beginning of the universe. At time 7, the infor- 
mation about events at x > xp(7) is inaccessible to an observer located at x = 0. 
In a universe with an initial singularity, we can always set n; = t; = 0, but in some 
nonsingular spacetimes, for example, the de Sitter universe, it is more convenient 
to take n; 4 0. Multiplying x, by the scale factor, we obtain the physical size of 
the particle horizon: 


Pat 
d,(t) = alt)xp = a) [ (2.7) 


Until hydrogen recombination (see Section 3.6), which occurred when the uni- 
verse was 1000 times smaller than now, the universe was opaque to photons. There- 
fore, in practice, our view is limited to the maximum distance light can travel since 
recombination. This is called the “optical” horizon: 


Pat 
dopt = A(N)(N — Nr) = a [S (2.8) 


Problem 2.2 Calculate 7,/no in a dust-dominated universe and verify that the 
present optical horizon is less than the particle horizon by only a small percentage. 


Although the optical horizon is not very different from the particle horizon, it 
unfortunately obscures information about the most interesting stages of the evolu- 
tion of the early universe. Primordial neutrinos and gravitational waves decouple 
from matter before photons, and so could, in principle, bring us this information. 
Sadly, the short-term prospects of detecting primordial neutrinos or cosmological 
gravitational waves are not very promising. 

Let us calculate the size of the particle horizon in flat matter-dominated and 
radiation-dominated universes. Substituting a(t) « t7/> into (2.7), we find that in 
a matter-dominated universe d,(t) = 3t (c = 1). If the universe is dominated by 
radiation, then a(t) « t'/* and, correspondingly, d p(t) = 2t. 


Problem 2.3 Calculate the size of the particle horizon in a dust-dominated universe 
with an arbitrary value of the current cosmological parameter (29 and show that 
2 
® = 2.9 
(xp) do HoQo ( ) 


where the function ® is defined in (2.3). 


Curvature scale (“Hubble horizon”) vs. particle horizon When matter satisfies the 
strong energy dominance condition, ¢ + 3p > 0, the particle horizon is usually of 
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order the Hubble scale, 1 /H. Consequently, the terms “Hubble scale” and “particle 
horizon” are sometimes used interchangeably. Some authors even conjoin the terms 
and refer to a “Hubble horizon.” However, the Hubble scale, H~!, is conceptually 
distinct from a horizon. Whereas the particle horizon is a scale set by kinematical 
considerations, the curvature scale is a dynamical scale that characterizes the rate 
of expansion and enters the equations describing, for instance, the evolution of 
cosmological perturbations. Because H~! is of order the 4-curvature scale, it also 
characterizes the “size” of the local inertial frame. 

Although the Hubble scale and particle horizon are of similar magnitudes for 
some models, they can differ by a large factor when the strong energy condition 
is violated, ¢ + 3p < 0. In this case, from (1.66), a > 0, that is, the expansion is 
accelerating. Then, the integral in the expression 


d,(t) = aw) [SF = an (2.10) 


aa 


converges as t + oo anda — ov. Atlarget , the particle horizon is proportional to 
a(t), but the curvature scale, H~! = a /a, grows more slowly since a also increases 
during accelerated expansion. For instance, the particle horizon in a flat de Sitter 
universe, where a(t) « exp (Hf), is 


d,(t) = exp(Fat) [ exp(—Hatidt = Hx! (exp(Aa(t — t))) — 1). (2.11) 


For t — t; >> Hj", the size of the causally connected region grows exponentially 
fast, whereas the curvature scale, 1 Pe is constant. Formally, as t; — —oo, the 
particle horizon diverges, and hence all points were in causal contact. However, 
this has limited significance since the flat slicing of de Sitter spacetime is geodesi- 
cally incomplete (see next section). Moreover, when applied as an approximation 
for inflation, we use only a part of the whole de Sitter spacetime. The beginning 
of inflation corresponds to a finite initial time ¢; and, consequently, the particle 
horizon is finite. 

Despite the fact that the curvature scale is not, if properly considered, a horizon, 
the use of the term “Hubble horizon” has become so widespread that we will 
occasionally follow the “traditional terminology.” However, the reader is strongly 
advised to keep in mind the distinction between the dynamical curvature scale and 
the kinematical horizon. 


Event horizon The event horizon is the complement of the particle horizon. The 
event horizon encloses the set of points from which signals sent at a given moment 
of time n will never be received by an observer in the future. These points have 
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comoving coordinates 


Nmax 
X > xen) = / dn = Nmax — 1. (2.12) 
n 


Hence, the physical size of the event horizon at time f¢ is 


d,(t) = a(t) J . (2.13) 


where “max” refers to the final moment of time. If the universe expands forever, 
then tmax 1s infinite. However, the value of ymax, and hence d,, can be either infinite 
or finite depending on the rate of expansion. In flat and open decelerating universes, 
tmax aNd Nmax are both infinite, x, and d, diverge, and so there is no event horizon. 
However, if the universe undergoes accelerated expansion, then the integral in (2.13) 
converges and the radius of the event horizon is finite, even if the universe is flat or 
open. In this case, 7 approaches a finite limit Nmax, aS tmax —> OO. 
An important example is a flat de Sitter universe, where 


d(t) = exp(Hat) / exp(—Hat) dt = Hy, (2.14) 


that is, the size of the event horizon is equal to the curvature scale. Every event 
that occurs at a given moment of time at a distance greater than Hy! will never be 
seen by an observer and cannot influence his future because the intervening space 
is expanding too rapidly. For this reason, the situation is sometimes characterized 
as “superluminal expansion.” 

In a closed decelerating universe, the time available for future observations is 
finite since the universe ultimately collapses. Therefore, there is both an event 
horizon and a particle horizon. 


Problem 2.4 Verify that, in a closed, radiation-dominated universe, the curvature 
scale H~! is roughly equal to the particle horizon size at the beginning of expansion 
but roughly coincides with the radius of the event horizon during the final stages 
of collapse. 


2.3 Conformal diagrams 


The homogeneous, isotropic universe is a particular case of a spherically symmetric 
space. The most general form of metric respecting spherical symmetry is 


ds* = gap(x°)dx“dx” — R*(x°) dQ’, (2.15) 
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where the indices a, b and c run over only two values, 0 and 1, corresponding to 
the time and radial coordinates respectively. The angular part of the metric is rather 
simple. It is proportional to 


dQ? = dé’ + sin’ 6dy’ (2.16) 


and describes a 2-sphere of radius R(x‘). The only nontrivial piece of the metric is 
the temporal-—radial part, which can describe spaces with different causal structure. 
The causal structure can be represented by a two-dimensional conformal diagram, 
in which every point corresponds to a 2-sphere. 

The global properties of the spacetime can be completely explored by considering 
the radial geodesics of light. As we showed in Section 2.1, in a coordinate system 
where metric (2.15) takes the form 


ds* = a°(n, x)[dn* — dx* — ®(n, x) dQ"), (2.17) 
the radial propagation of light is described by the equation 
x(n) = £n + const, (2.18) 


or in other words, by straight lines at +45 degree angles in the n—x plane. 
In principle, it is always possible to find a coordinate system that allows us to 
write (2.15) as (2.17). In the coordinate transformation 


x* —> X° = (n(x*), x (x’)), 


the freedom to choose the two functions 7 and x means we can impose the two 
conditions 


G1 =0, B00 = B11 = a(n, x). 


Solving the equations for 7 and x can be difficult in general, but in cosmologically 
interesting cases the metric is already in the required form. 

Typically, 7 and x may extend over infinite or semi-infinite intervals. Since 
our goal is to visualize the causal structure of the full spacetime, in these cases 
we perform a further coordinate transformation that preserves the form of metric 
(2.17) but maps unbounded coordinates into coordinates which vary over a finite 
interval. We shall see that it is always possible to find such transformations. In this 
section, we reserve the symbols 7 and x to refer only to bounded coordinates. 

A conformal diagram is a picture of a spacetime plotted in terms of 7 and x. 
Hence, a conformal diagram always has finite size and light geodesics (null lines) 
are always represented by straight lines at +45 degree angles. These are the defining 
features of a conformal diagram. Although the finite ranges spanned by the coordi- 
nates and the size of the diagram can be altered, its shape is uniquely determined. 
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Note that the diagrams of different spacetimes are exactly the same if their metrics 
are related by a nonsingular conformal transformation: 8, = a(x) Suv. 

In addition to the shape of the diagram, we must pay attention to the location of 
singularities. Singularities, as well as the boundaries of the diagram, are determined 
by the behavior of the scale factor a(n, x) and the function ®(7, x) in (2.17). We 
will see that it is possible to have two spacetimes whose conformal diagrams have 
the same shape but different singular boundaries. 


Closed radiation- and dust-dominated universes For a closed universe filled with 
radiation or dust, the conformal diagram can be immediately drawn based on the 
solutions for a(7) found in Section 1.3.4. Metric (2.2) becomes 


ds* =a" (n)(dn’ — dx? — sin’ xd’), (2.19) 
where 
a =a Sin 7 (2.20) 


in a radiation-dominated universe and 
a =a,(1 — cos n) (2.21) 


in a dust-dominated universe (see (1.73) and (1.77)). In both cases, x and 7 have 
finite ranges and cover the whole spacetime: 


zr>x>0, z>n>QO0, (2.22) 
for a radiation-dominated universe and 
zr>x>0, 2x >n>O0, (2.23) 


for a dust-dominated universe. The conformal diagrams are a square and rectangle 
respectively, and are shown in Figures 2.1 and 2.2. Horizontal and vertical lines 
represent hypersurfaces of constant 7 and x. The lower and upper boundaries 
correspond to physical singularities where the scale factor vanishes and the energy 
density and curvature diverge. In both cases, the lower half of the diagram describes 
an expanding universe and the upper half corresponds to a contracting phase. The 
scale factor reaches its maximum value at n = 7/2 in the radiation-dominated 
universe and at 7 = z in the dust-dominated universe. 

The essential difference between the diagrams is the comparative ranges of 7 
and x: for the dust-dominated universe n has twice the range of x, while 7 and 
x have the same range for the radiation-dominated universe. This has important 
consequences for the particle and event horizons. In both cases, we can set n; = 0 
at the lower boundary of the diagram. Then the particle horizon for the observer at 
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final singularity 


particle horizon 


7) = const 


x = const event horizon 





initial singularity T 


Fig. 2.1. 


x = Ois given by 


XM) =n - Hi = 7. (2.24) 


In the radiation-dominated universe, the particle horizon spans the whole space 
when n — 7, that is, just as the universe recollapses. At this last moment of time, 
all points in space become visible. The light that reaches an observer from the 
most remote point, x = , reveals information about the state of the universe at the 
beginning of expansion. In the dust-dominated universe, the whole universe also 
becomes entirely visible at 7 = 2. However, here this corresponds to the moment 
of maximum expansion. There remains enough time for light to make a second trip 
across the whole space before the universe recollapses. 
The event horizon is given by 


Xe(N) = Nmax — 7. (2.25) 


In the radiation-dominated universe, there exists an event horizon for any 7 since 
Nmax = ZH. In contrast, for the dust-dominated universe, where Nmax = 277, the event 
horizon exists only during the contraction phase when 7 > zr. All events that occur 
at 7 < m,no matter how far away, can be seen before the universe recollapses. 

In summary, as shown in Figures 2.1 and 2.2, particle and event horizons ex- 
ist at each 7 in the closed radiation-dominated universe. In the matter-dominated 
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events horizon 


n-0 
ptr 





observer ———+ 


0, :p = const 


particle horizon 


universe, the particle horizon exists only during the expansion phase, and the event 
horizon exists only during the contraction phase. 

The points x = 0, z are the opposite poles of the three-dimensional sphere 
describing the spatial geometry at any given moment of time. Light propagating 
with constant 6, @ away from an observer located at x = 0 reaches the opposite 
pole at x = z. Because the coordinate system we are using is singular at the poles, 
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to clarify what happens to light after passing through the pole, one has to use another 
coordinate system which is regular near x = z. 


Problem 2.5 Show that light propagating away from an observer at x = 0 in the 
direction (6, g) begins to propagate back towards the observer along the direction 
(=n —0, 6=@+7) after it passes through the pole at y = z. 


Thus, a light geodesic is “reflected” from the boundary at x = z and its angular 
coordinates @ and g change. This change of the angular coordinates is not apparent 
from the conformal diagram because they are suppressed there. 

Let us use a conformal diagram to infer how a galaxy located at x = x, = 
const appears to an observer at x = O in a dust-dominated universe. As is clear 
from Figure 2.2, at » > 27 — x,, when the universe is contracting, there are two 
geodesics along which light emitted by the galaxy can reach the observer. Hence, the 
observer simultaneously sees two images of the same galaxy in opposite directions 
in the sky. One image is older than the other by An = 2(x — Xe) In a radiation- 
dominated universe, only one image of the galaxy can be seen because light does 
not have enough time to travel around the pole at y = mw and reach an observer 
before the universe recollapses. 


Problem 2.6 Using (1.83), draw the conformal diagram for a closed universe filled 
with a mixture of dust and radiation. 


De Sitter universe De Sitter spacetime is an example of how different coordinate 
systems used for the same spacetime can lead to different conformal diagrams. We 
begin by rewriting metric (1.106) in terms of conformal time instead of physical 
time ft. For a closed universe, the relation is 


t 
= / a — arcsin[tanh(H,t)] — — (2.26) 
o Hx! cosh(Hat) - 2: : 


The conformal time 7) is always negative and ranges from —z to 0 as f varies from 
—oo to +oc. It follows from (2.26) that 

cosh(Hat) = —(sinn)!, (2.27) 
which allows us to write the metric of the closed de Sitter universe as 


1 
ds” = ——— (dn? — dx’ — sin’ xdQ’). 2.28 
Since the spatial coordinate x varies from 0 to z and the temporal coordinate 7 
changes from —z to 0, the conformal diagram for a closed de Sitter universe is 
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particle horizon 


7 = const 


x = const 





event horizon 








a=Co 


Fig. 2.3. 


a square. In fact, it has the same shape as the diagram for a closed, radiation- 
dominated universe, with the difference that there are no singularities at n; = —z 
and max = 0 — see Figure 2.3. Moreover, in a de Sitter universe, the scale factor, 
a(n) = —1/Ha, sin n, is infinite at the lower boundary of the diagram where n —> 
—mz , decreases as n changes from —z to —7/2, reaches its minimum value 1/H,, 
and then grows to infinity again as 7 — 0. The blowing up of the scale factor does 
not signify a singularity. We have seen that all curvature invariants are constant in 
de Sitter spacetime, and hence, the infinite growth of the scale factor is entirely a 
coordinate effect. 

As with the closed radiation-dominated universe, de Sitter spacetime has both a 
particle horizon, 


wWY=H-n)=n+n, (2.29) 


and an event horizon, 


Xe() = (Hmax — 1) = —N, (2.30) 


which exist at any time 7. In both the closed de Sitter and radiation-dominat- 
ed universes, the physical size of the event horizon d, (t) approaches the curvature 
scale H~! near the upper boundary of the conformal diagram. However, in de Sitter 
spacetime, H, and consequently the size of the event horizon, remain constant; ina 
radiation-dominated universe, H increases and the size of the event horizon shrinks. 
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n t =const 
T 
x 
r=Hy’ 
t =-0oo 
r<HAy 





Problem 2.7 One can utilize for de Sitter spacetime the so called “static coordi- 
nates” 7, r, related to n, x via 
cos sin 


tanh(H,i) = , re *. (2.31) 
COS Xx sin n 








Verify that in these coordinates the metric takes the following form: 


2 

ds? = [1 —(Har)*]d?? - eee — dQ. (2.32) 
The hypersurfaces of constant r and 7 are shown in Figure 2.4. De Sitter horizons 
correspond to r = ae and ? = too. The static coordinates cover only half of de 
Sitter spacetime: regions I and III in Figure 2.4. They are singular on the horizons 
but can be continued beyond. For r > H;’, the radial coordinate r plays the role 
of time and 7 becomes a space-like coordinate. Introduce the proper-time 


1/2 


dt =dr/[(Hary — 1] (2.33) 


and verify that in regions I and IV the “static” metric (2.32) describes contracting 
and expanding space respectively. We conclude that there exists no static coordinate 
system covering de Sitter spacetime on scales exceeding the curvature scale. Note 
that the trajectory r = const is a geodesic only if r = 0. 
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In a flat de Sitter universe, the scale factor grows as a(t) = Hy exp(H,f), 
where the physical time f is related to the conformal time 7 via 


exp(H,t) = —1/7. (2.34) 


Hence, in conformal coordinates the metric becomes 





ds* (di? — dx* — x7dQ’), (2.35) 


ARi? 
where 0 > 7 > —co and +00 > x > 0. Unlike the case of a closed de Sitter uni- 
verse, here 7, ¥ have infinite ranges and to draw the conformal diagram, we must 
first transform to coordinates which range over finite intervals. Fortunately, there 
is a natural choice for such coordinates: we simply use the n, x coordinates of the 
closed de Sitter universe. The relation between 7, x and 7, x coordinates immedi- 
ately follows from (1.99) if we express t and f in terms of 7 and 7 respectively. The 
result is 


sin sin x 


—_ . =e, 2.36 
é cos 7 + Cos x x cos 7 + COS x ( ) 


Using these relations, one can draw the hypersurfaces of constant 7 and x (the 
coordinates in (2.35)) in the n—y plane, as shown in Figure 2.5. We find that when 
n, x coordinates run over their semi-infinite ranges, they cover only half of de Sitter 


7 = const 





50 Propagation of light and horizons 


spacetime, a triangle whose lower boundary coincides with the particle horizon. On 
the particle horizon, 7 — —oo, ¥ — +00, and hence, the flat coordinates become 
singular. 


Problem 2.8 To understand the shape of the constant 7 and constant x hypersur- 
faces near the corners of the triangular conformal diagram, x = 0, 7 = —z and 
x =x, =0, calculate the derivatives dn/dx along these hypersurfaces. 


Viewing the flat de Sitter solution as describing an infinite space, we can cat- 
egorize the types of infinities that arise. For instance, space-like infinity, where 
xX — +o along a hypersurface of constant 7, is represented on the conformal di- 
agram by a point which is denoted as i°. The past time-like infinity, from where 
all time-like lines emanate, occurs at 7 — —oo for finite x and is denoted by i~. 
The lower diagonal boundary of the flat de Sitter diagram corresponds to the region 
from which incoming light-like geodesics originate. It is easy to verify that as we 
approach this boundary, ¥ — oo and 7 — — oo but the sum x + 7 remains finite. 
This infinity is called past null infinity and denoted by .7 ~. 

In an open de Sitter universe, the relation between physical and conformal times 
is 


sinh (Hq?) = —1/ sinh 7 (2.37) 
and the metric becomes 
1 
ds* = ——— (di’ — dx? — sinh’ xd’). 2.38 
H* sinh? jj ( : x # ) ( ) 


The coordinates run over the same range as in a flat de Sitter universe, 0 > 7 > —oo 
and +00 > X > O, therefore the conformal diagrams of these two spaces will look 
similar. We can again use the closed coordinates to determine which part of the 
de Sitter spacetime is covered by the open coordinates. The corresponding relation 
between coordinate systems follows from (1.99): 


sin 
tanh 7 = a tanh ¥ = 





; (2.39) 
cos 


In this case, the coordinates 7, x cover only one eighth of the whole de Sitter 
spacetime (Figure 2.6), and thus, cover an even smaller part of the de Sitter manifold 
than the flat coordinates. Of course, it only makes sense to compare the sizes of 
different diagrams when they describe the same spacetime, as in the case of the de 
Sitter manifold. Otherwise, as noted before, the size of the diagram has no invariant 
meaning. 


Problem 2.9 Calculate the derivative dy/dx along the hypersurfaces 7 = const 
and ¥ = const, near i~ and i° respectively. 
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X = const 


7) = const 








Fig. 2.6. 


The conformal diagrams show explicitly that flat and open de Sitter universes 
are geodesically incomplete. For instance, following a geodesic for a photon, which 
arrives at x = 0, into its past, we find that this geodesic leaves first the open and 
then the flat de Sitter region. 

Finally, we note that the hypersurfaces of constant time in all coordinate systems 
become increasingly flat and similar for x < 2/2 as n — O-. In this limit, the 
scale factor is inversely proportional to conformal time or, equivalently, increases 
exponentially with physical time. 

The reader may naturally wonder why we need to study the same de Sitter 
spacetime in three different coordinate systems. As mentioned previously, the de 
Sitter spacetime is useful in a practical sense because it can be viewed as the leading 
order approximation to a universe undergoing inflationary expansion. In realistic 
inflationary models, time-translational invariance is broken and the energy density 
varies slightly with time. The hypersurface along which inflation ends is usually the 
hypersurface of constant energy density and the geometry of the future Friedmann 
universe depends on its shape. It can, in principle, be the surface of constant time 
in closed, flat or open de Sitter coordinates and, as a result of a graceful exit from 
inflation, one obtains a closed, flat or open Friedmann universe respectively. 

The full cosmic history can be represented by gluing together the pieces of 
conformal diagrams describing different phases of the universe’s evolution. When 
gluing these pieces, however, one should not forget that every point of the diagram 
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corresponds to a 2-sphere and that the 3-geometries of hypersurfaces along which 
the diagrams are glued must match. 

To complete the set of the diagrams needed in cosmology, we must also con- 
struct the conformal diagrams describing open and flat universes filled by matter 
and radiation. As a preliminary step, we first consider the conformal diagram for 
Minkowski spacetime, which turns out to be useful in drawing the diagrams of 
more complicated infinite spaces. 


Minkowski spacetime In spherical coordinates the Minkowski metric takes the form 
ds? = dt — dr? — rd’. (2.40) 


It is trivially conformal but the time and radial coordinates range over infinite 
intervals, + 0o > tf > —oo and +co > r > 0, and, therefore, have to be replaced 
by coordinates with finite ranges. There exist many such coordinate systems for 
Minkowski spacetime. One choice is to introduce 7 and x coordinates which are 
related to the ¢ and r coordinates in the same way as closed and open de Sitter 
coordinates are related (see (2.39)), namely, 


sin 7 sin x 





tanht = ‘ tanhr = : (2.41) 
COS x cos 
The Minkowski metric in the new coordinates then becomes 
1 
ds* = —— (dn? — dx* — Wn, x) dQ’) , (2.42) 


cos? x — sin’ 7 


where W can be calculated but is not important for our purposes. Comparing the 
Minkowski time ¢ to 7 in an open de Sitter universe (2.39), we see that ¢ runs from 
—oo to +00, while 7 is restricted to negative values (because the scale factor in 
open de Sitter spacetime blows up as 7 — 0). Therefore, in the 7—x plane, the 
hypersurfaces of constant t and r span a large triangle, which can be thought of 
as made from two smaller triangles describing the open de Sitter spacetime and its 
time-reversed copy (Figure 2.7). Minkowski spacetime possesses two additional 
types of infinities compared to an open de Sitter universe: a future time-like infinity 
it, where all time-like lines end (t > +00, r is finite), and a future null infinity 
J+ (t > +00, r > +00 with t —r finite), the region towards which outgoing 
radial light geodesics extend. Region I in Figure 2.7 corresponds to a future light 
cone which can also be covered by Milne coordinates. The Milne conformal diagram 
is geometrically similar to the Minkowski one, though it is four times smaller. 


Problem 2.10 Draw the conformal diagram for the Milne universe and verify this 
last statement. 
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Open and flat universes Now we will use the Minkowski conformal diagram to 
construct the diagram for open and flat universes dominated by matter satisfying 
the strong energy dominance condition, ¢ + 3p > 0. The metric is 


ds? = a*(h) (di? — dx? — &*(Z%) dQ’), (2.43) 


where the scale factor a vanishes at a singularity occurring at 7 = 0. (Here we have 
added tildes to the notation in (2.2) since n, x are reserved for coordinates with 
finite ranges.) The conformal time 7 is confined to the range (0, +00) . Since ® (7) 
is equal to x for a flat universe and to sinh x for an open universe, in both cases 
Xx changes from 0 to +00. For 7 > 0, the temporal—radial part of metric (2.43) is 
related to the Minkowski metric (2.40) by a nonsingular conformal transformation. 
The coordinates ¢ andr considered in the upper half of Minkowski spacetime (t > 0) 
span the same range as the 7, ¥ coordinates. Hence, the conformal diagrams of open 
and flat universes should have the same shape as the upper half of the Minkowski 
conformal diagram (Figure 2.8). The hypersurfaces of constant 7 and constant x 
can then be drawn in the n—y plane, where n, x are related to 7, ¥ as in (2.41) with 
the substitutions t + 7 andr — x. In open and flat universes, the lower boundary 
(7 = 0) corresponds to a physical singularity. 
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Problem 2.11 Draw the conformal diagram for open and flat universes where 
the scale factor changes as a(t) x t?, p > 1. This is the situation for power-law 
inflation. Note that the strong energy condition is violated in this case. Indicate the 
particle and event horizons and the types of infinities. Draw the conformal diagram 
for a flat universe filled by matter with equation of state p = —e/3. Compare this 
case with the Milne universe. 


Problem 2.12 The metric of an eternal black hole in the Kruskal—Szekeres coor- 
dinate system takes the form 


ds? =a"(v,u)(dv? — du* — W*(v, u) dQ’). (2.44) 


The only extra information we need to draw the conformal diagram is that the space- 
like coordinate u ranges from —oo to +00 and that there is a physical singularity 
located at 


v—-w=1. 


The existence of a singularity means that, for every u, the spacetime cannot be 
extended outside the interval 


—JVl+w<v<4+vV1l4+u?. 
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Draw the conformal diagram for the eternal black hole and identify the types of 


infinities. The Schwarzschild radius of the black hole is located at v? = u?. 


2.4 Redshift 


The expansion of the universe leads to a redshift of the photon wavelength. To 
analyze this effect, let us consider a source of radiation with comoving coordinate 
Xem, Which at time nem emits a signal of short conformal duration An (Figure 2.9). 
According to (2.5), the trajectory of the signal is 


ROY = Ken — end) 


and it reaches a detector located at x,,; = 0 at time Nops = Nem + Xem- The confor- 
mal duration of the signal measured by the detector is the same as at the source, 
but the physical time intervals are different at the points of emission and detection. 
They are equal to 


Atem = a(Nem)AN and Atos = A(Novs) AN 


respectively. If Af is the period of the light wave, the light is emitted with wavelength 
em = Atem but is observed with wavelength 4,5; = Atons, so that 
obs A(Nobs) 


_ : 2.45 
Aem (Nem) oe 








UT 


Nobs + An 


Nlobs Xs 


An 
Nem 


Xem x 
Fig. 2.9. 
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Thus, the wavelength of the photon changes in proportion to the scale factor, A(t) « 
a(t), and its frequency, w « 1/A, decreases as 1/a. 

The Planck distribution, characterizing blackbody radiation, has the important 
property that it preserves its shape as the universe expands. However, because each 
photon is redshifted, w — w/a, the temperature T scales as 1/a. Therefore, the 
energy density of radiation, which is proportional to 7*, decreases as the fourth 
power of the scale factor, in complete agreement with what we obtained earlier for 
an ultra-relativistic gas with equation of state p = ¢/3. The number density of the 
photons is proportional to 7*, and therefore decays as the third power of the scale 
factor so that the total number of photons is conserved. 


Redshift as Doppler shift The cosmological redshift can be interpreted as a Doppler 
shift associated with the relative motion of galaxies due to Hubble expansion. If 
we begin with two neighboring galaxies separated by distance Al < H™~!, then 
there exists a local inertial frame in which spacetime can be considered flat. Ac- 
cording to the Hubble law, the relative recessional speed of the two galaxies is 
v = H(t) Al « 1. Because of this, the frequency of a photon,  (t;) , measured 
by an observer at galaxy “1” at the moment ¢,, will be larger than the frequency 
of the same photon, (tz), measured at t2 > ft; by an observer at galaxy “2”, by a 
Doppler factor (Figure 2.10): 


Ao = a(t) — w(t2) © w(t) v = w(t) H(t) Al. (2.46) 


The time delay between measurements is At = t2 — ft; = Al and so we can rewrite 
(2.46) as a differential equation: 


o=-—H(t)o. (2.47) 
This has the solution 


ox l/a. (2.48) 


w(t) ake) 
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Fig. 2.10. 
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Although the derivation above has been performed in a local inertial frame, it can 
be applied piecewise to a general geodesic photon trajectory. The result is therefore 
valid in curved spacetime as well. However, the interpretation of the redshift as a 
Doppler shift is not applicable for distances larger than the curvature scale. In this 
limit, as we have pointed out, distance and relative velocity do not have an invariant 
meaning, so the notion of Doppler shift becomes ill defined. 


Redshift of peculiar velocities The peculiar velocities of massive particles (veloc- 
ities with respect to the Hubble flow) are also redshifted as the universe expands. 
The peculiar velocity of a particle, w(t,), measured by observer “1” at time ft, 
is different from the peculiar velocity of the same particle, w(t2), measured by 
observer “2”, by the relative Hubble speed of the observers: v = H(t)AI. Hence, 


w(t) — w(h) © v = H(t) Al. (2.49) 


Given that the particle needs time At = tf — t; = Al/w to make the journey be- 
tween the two observers, we can rewrite this equation as 


w= —H(t)w. (2.50) 
Once again we have the solution 
w «x l/a. (2.51) 


Thus, the expansion of the universe eventually brings particles to rest in the co- 
moving frame. 

The temperature of a nonrelativistic gas of particles is proportional to the peculiar 
velocity squared, 


Prag Ow? & A/a’, (2.52) 


and therefore, if the gas and radiation are decoupled, gas will cool faster than 
radiation. 

For the same reasons as in the case of radiation, the above derivation for peculiar 
velocities is rigorous and applicable in curved spacetime. This can also be verified 
directly by solving the geodesic equations for the particles. 


Problem 2.13 Show that the geodesic equation 


du* 
ds 





+ ry urur =0 (2.53) 


can be rewritten as 


dug 1 Ogg, Boy 
pees = 0. 2.54 
ds axe Con 
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One can always go to a coordinate system in which only the radial peculiar 
velocity of the particle, u*, is different from zero. Taking into account that in an 
isotropic, homogeneous universe the metric components g,,,, and g,, do not depend 
on x, we infer from (2.54) that vu, = const. Hence, the peculiar velocity, 


w=au* =ag**u, « fet (2.55) 


decays in inverse proportion to the scale factor. 


2.4.1 Redshift as a measure of time and distance 
The redshift parameter is defined as the fractional shift in wavelength of a photon 


emitted by a distant galaxy at time ¢,,, and observed on Earth today: 


= \obs — Xem (2 56) 
eles ae 


According to (2.45), the ratio Agps/Aem iS equal to the ratio of the scale factors at 
the corresponding moments of time, and hence 


ao 





1l+z= 


Tra (2.57) 


where dp is the present value of the scale factor. 

The light detected today was emitted at some earlier time f,,, and, according 
to (2.57), there is a one-to-one correspondence between z and f,,,. Therefore, the 
redshift z can be used instead of time f to parameterize the history of the universe. 
A given z corresponds to a time when our universe was 1 + z times smaller than 
now. We can express all time-dependent quantities as functions of z. For exam- 
ple, the formula for the energy density ¢(z) follows immediately from the energy 


conservation equation dé = —3(e + p)d Ina: 
@(z) d 
é 
— = 3ln(1 +z). 2.58 
ae) ( ) (2.58) 


€0 


To obtain the expression for the Hubble parameter H in terms of z and the present 
values of Ho and (29, it is convenient to rewrite the Friedmann equation (1.67) in 
the form 

7 €(Z) 


k 
H(z) + alt zy = QoHG a (2.59) 
0 0 
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where the definitions in (1.21) and (2.57) have been used. At z = 0, this equation 
reduces to 


k 2 
S =(2o — 1) Hh. (2.60) 
AQ 
allowing us to express the current value of the scale factor ap in a spatially curved 
universe (k 4 0) in terms of Hp and Qo. Taking this into account, we obtain 


A(z) = Ho((1 = Boyt +2)? + ae (2.61) 


Generically, the expressions for a(t) are rather complicated and one cannot di- 
rectly invert (2.57) to express the cosmic time ft = f,,, in terms of the redshift 
parameter z. It is useful, therefore, to derive a general integral expression for f(z). 
Differentiating (2.57), we obtain 


dz=—- pi at = —(1+4+ z) A(t) dt, (2.62) 


from which it follows that 


7 | HO +2) 


A constant of integration has been chosen here so that z — oo corresponds to the 
initial moment of time, t = 0. Thus, to determine t(z), one should first find ¢(z) 
and, after substituting (2.61) into (2.63), perform the integration. 

Knowing the redshift of light from a distant galaxy we can unambiguously 
determine its separation from us; that is, redshift can also be used as a measure 
of distance. The comoving distance to a galaxy that emitted a photon at time ¢.,, 


which arrives today is 
fo 


eee S -/[= (2.64 
X = "0 Nem = a(t)’ : ) 


tem 


Substituting a(t) = dg/(1 + z) and the expression for dt in terms of dz from (2.62), 
we obtain 
z 
1 dz 


x(Z) = Ho (2.65) 


In a universe with nonzero spatial curvature i # 0), the current value of the scale 
factor ag can be expressed in terms of Ho and Qo via (2.60): 


ay! = V|Qo0 — 1|Ho 
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Note that as z — 00, x(z) approaches the particle horizon. Hence, the redshift 
parameter measures distance only within the particle horizon. 

Finally, let us find the explicit expressions for t(z) and x (z) in a dust-dominated 
universe. In this case, ¢(z) = €9(1 + zy and 


A(z) = Api +z) V1 4+ Qoz. 


For a flat universe (QQg = 1), the integrals in (2.63) and (2.65) are straightforward 
and we find 


2 1 2 1 
Sop aa x(z) = =! = =) : (2.66) 
3Ho(1 + z)?/? ayH J1+z 
Problem 2.14 Verify that in both open and closed dust-dominated universes 
2/|Qo — 1 
P(x (z)) = oe , [202 +(Qo — (V1 + Qoz — 1)] (2.67) 
0 z 


where the function ® is defined in (2.3). Note that if Qoz >> 1, then ®(y(z)) > 
o( x a) given in (2.9). Derive the explicit expressions for f(z). 


t(z) = 


2.5 Kinematic tests 


For an object at a cosmological redshift, it is desirable to measure its angular size 
(the angle the object subtends on the sky) or its apparent luminosity. Given a class 
of objects of the same size (standard rulers), we find that the corresponding angular 
size changes with redshift in a specific way that depends on the values of the 
cosmological parameters. The same is also true for the apparent luminosities of 
objects with the same total brightness (standard candles). Therefore, if we know 
the appropriate dependencies for particular classes of standard rulers or standard 
candles, we can determine the cosmological parameters. Moreover, because the 
measurements refer to earlier times when the universe was | + z times smaller 
than now, we can study its recent expansion history and distinguish among models 
with different matter content. 


2.5.1 Angular diameter-redshift relation 


In a static, Euclidean space, the angle which an object with a given transverse 
size subtends on the sky is inversely proportional to the distance to this object. In 
an expanding universe, the relation between the distance and the angular size is 
not so trivial. Let us consider some extended object of given transverse size / at 
comoving distance x,,, from an observer (Figure 2.11). Without loss of generality, 
we can set g = const. Then, photons emitted by the endpoints of this object at time 
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tem propagate along radial geodesics and arrive today with an apparent angular 
separation A@. The proper size of the object, /, is equal to the interval between the 
emission events at the endpoints: 


1 = V—AS? = a(tem) ®(Xem) AO, (2.68) 
as obtained from metric (2.2). The angle subtended by the object is then 


l l 


A@ => =. ‘ 
A(tem) P(Xem) a(no — Xem) P(Xem) 


(2.69) 





where we have used the fact that the physical time f,,, corresponds to the conformal 
time Nem = No — Xem- If the object is close to us, that is, xem « no, then 


a(no = Xem) x a(no) ; P(Xem) ~ Xems 


and 


l l 


~ — 


a(no) Xem ns D’ 


We see that in this case A@ is inversely proportional to the distance, as expected. 
However, if the object is located far away, namely, close to the particle horizon, 
then yo — Xem K no, and 


a(no — Xem) K a(yo), P(Xem) > P(X») = const. 


The angular size of the object, 


l 


Aé « ————_., 
a(no — Xem) 
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increases with distance and as it approaches the horizon its image covers the whole 
sky. Of course, the apparent luminosity drops drastically with increasing distance, 
otherwise remote objects would completely outshine nearby ones. 

To understand this unusual behavior of the angular diameter, it is again useful to 
turn to a low-dimensional analogy and consider how an observer on the north pole 
of the Earth would see an object of a given size at various distances. In this analogy, 
light propagates along meridians, which are geodesics on the Earth’s surface, and 
we find that the angular size decreases with distance, but only if the object is north 
of the equator. If the object is south of the equator, the angular size increases with 
distance until, finally, an object at the south pole “covers the whole sky.” This 
analogy, while illuminating, is not complete. The angular size of a very remote 
object also grows in a flat universe because of the time dependence of the scale 
factor; the 4-curvature of spacetime is responsible for the unusual behavior of the 
angular diameter. 

The angular size A@ can be expressed as a function of redshift z. Since 
ao/a(tem) = 1+ z, we can write (2.69) as 


A@(z) =(1 +z) (2.70) 


l 
ay P(Xem (z)) 
where Xem(z) is given by (2.65). In a flat universe filled with dust, the function 
®(Xem) equals Xem, whose explicit dependence on z was given in (2.66). Hence, 
the angular diameter as a function of z is 


1H) A+2z)” 


A = s 
0(z) 2 dtp? 


(2.71) 





At low redshifts (z < 1), the angular diameter decreases in inverse proportion to 
Z, reaches a minimum at z = 5/4, and then scales as z for z >> 1 (Figure 2.12). 

The extension to more general cosmologies is straightforward. For example, 
substituting ®(x.,,) from (2.67) into (2.70), we find that in a nonflat dust-dominat- 
ed universe, 


Ho 221 +z)’ 
2 Qoz + (Qo — 21 + Qoz)'? — 1) 


In principle, having standard rulers distributed over a range of redshifts we could 


Mes (2.72) 





use the measurements of angular diameter versus redshift to test cosmological 
models. Unfortunately, the lack of reliable standard rulers has hampered progress 
in this technique for many years. 

One spectacular success, though, has been a single standard ruler extracted from 
measurements of the cosmic microwave background. The temperature autocor- 
relation function measures how the microwave background temperature in two 
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directions in the sky differs; this temperature difference depends on the angular 
separation. The power spectrum is observed to have a series of peaks as the angular 
separation is varied from large to small scales. The “first acoustic peak” is roughly 
determined by the sound horizon at recombination, the maximum distance that a 
sound wave in the baryon-radiation fluid can have propagated by recombination. 
This sound horizon serves as a standard ruler of length , ~ H~!(z,). Recombin- 
ation occurs at redshift z, ~ 1100. Since 029z, >> 1, we can set Xem(Z-) = Xp in 
(2.70) and in a dust-dominated universe, where o( XxX 2) = 2(ay HpQo)“! (see (2.9)), 
we obtain 


2 Zr Hp Qo A 1-2 


~ SHG) <3 O.- 208ru, (2.73) 





r 


We have substituted here Hp/H(z,) = (2023) oe . as follows from (2.61). Note that 
in Euclidean space, the corresponding angular size would be A@, ~ t,/to © a 
or about 1000 times smaller. 

The remarkable aspect of this result is that the angular diameter depends directly 
only on &29, which determines the spatial curvature, and is not very sensitive to 
other parameters. As we will see in Chapter 9, this is true not only for a dust- 
dominated universe, as considered here, but for a very wide range of cosmological 
models, containing multiple matter components. Hence, measuring the angular 
scale of the first acoustic peak has emerged as the leading and most direct method 
for determining the spatial curvature. Our best evidence that the universe is spatially 
flat (Qo = 1), as predicted by inflation, comes from this test. 
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2.5.2 Luminosity-redshift relation 


A second method of recovering the expansion history is with the help of the 
luminosity—redshift relation. Let us consider a source of radiation with total lum- 
inosity (energy per unit time) L located at comoving distance xem, from us. The 
total energy released by the source at time f,,, within a conformal time interval An 
is equal to 


A Eom = LAtem(An) = La(tem) An. (2.74) 


All of the emitted photons are located within a shell of constant conformal width 
Ax = An. The radius of this shell grows with time and the frequencies of the 
photons are redshifted. Therefore, when these photons reach the observer at time 
fo, the total energy within the shell is 


bei (Le, 
(tem) _ 12 ( di 
ao ag 


AE obs = AE oi (2.75) 





At this moment, the shell has surface area 

Ssn(to) = 42049®°(Xem) 
and physical width 

Aly = ag Ax = aoAn. 


The shell passes the observer’s position over a time interval (measured by the 
observer) Afs, = Als, = aogAn. Therefore, the measured bolometric flux (energy 
per unit area per unit time) is equal to 

AE obs L a (lew) 


i = 2.76 
Ssn(to) Atsh An D?(Xem) ag 





or, as a function of redshift, 
L 
Fa z 5 
Ara ®?(Xem(z))C + z) 


Here Xem(z) is given by (2.65). Instead of F, astronomers often use the apparent 
(bolometric) magnitude, m,,;, defined as 





(2.77) 


Mbpol(Z) = —2, 5 logy F = 5 logio(1 + z) + 5 logio(P(Xem(Z))) + const, (2.78) 


where const is z-independent. 
For z < 1, we find that, irrespective of the spatial curvature and matter compo- 
sition of the universe, 


2.5 
Mpol(Z) = 5 logigz + mio" =o) Z + O(z’) + const, (2.79) 
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where go = —(a /aH ) - im turn, the value of the deceleration parameter qo 1s 
determined by the equation of state. Using Friedmann equation (1.66), we obtain 


Woe 29(1 as 32) (2.80) 
2 E70 

Thus, measuring the luminosity—redshift dependence for a set of standard candles, 

we can, in principle, determine the effective equation of state for the dominant 

matter components. 

Measurements using Type IA supernovae as standard candles have produced a 
spectacular result. The expansion of the universe has been found to be accelerating, 
rather than decelerating. In other words, go is negative. In a matter-dominated 
universe, the gravitational self-attraction of matter resists the expansion and slows 
it down. According to Friedmann equation (1.66), acceleration is possible only if 
a substantial fraction of the total energy density is a “dark energy” with negative 
pressure or, equivalently, negative equation of state w = p/e. 

One possibility is that the dark energy component is a vacuum energy density 
or cosmological constant, which corresponds to w = —1. Alternatively, the dark 
energy can be dynamical, such as a slightly time-varying scalar field. The latter 
case is referred to as “quintessence.” The discovery of cosmic acceleration raises a 
number of new problems in cosmology. At present, there is no convincing explana- 
tion as to why dark energy came to dominate so late in the history of the universe 
and exactly at the time to be observed. Additionally, because the nature of the dark 
energy is uncertain, the long-term future of the universe cannot be determined. 
If the dark energy is a cosmological constant, then the acceleration will continue 
forever and the universe will become empty. On the other hand, if the dark energy 
is a dynamical scalar field, then this field may decay, repopulating the universe 
with matter and energy. In summary, dark energy is one of the most enigmatic and 
challenging issues in cosmology today. 

The supernovae that provide evidence for the dark energy component have red- 
shifts of order unity and the expansion in (2.79), valid only for z < 0.3, is not 
applicable for them. Therefore, to describe the observations, we have to use the 
exact formula (2.78) and choose a particular class of cosmological models in order 
to compute ®(x-.,,(z)). For example, for a flat universe comprising only cold matter 
and a cosmological constant, so that Q9 = Qa + Q, = 1, we have 


1 / di 
Hod J JQn(1 + 28 +1 = Qn) 





D(Xem(Z)) = Xem(Z) = (2.81) 





Calculating the integral numerically, we can find m,,;(z) for different values of Q2,, 
(Figure 2.13). The best fit to the data is achieved for Q,, ~ 0.3. 
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Fig. 2.13. 


Problem 2.15 In Euclidean space, the observed flux F from an object of luminosity 
L at distance d is F = L/4zd? and the angular size of an object of known length 
lis A@ =1/d. Based on these relations, cosmologists sometimes formally define 
the luminosity distance d; and the angular diameter distance d, to an object in an 


Pan pe 7 Hence (2.82) 
eS \anF) ° ~*~ Ke’ 


respectively. Calculate d,(z) and d4(z) in a dust-dominated universe. How are they 
related in general? Verify that the distances dy, and d, coincide only to leading 
order in z and at small z revert to the Euclidean distance d. In contrast with d,, the 
luminosity distance d; increases with z at large redshift, as common sense would 
suggest. Both, however, are only formal for z > 1 where the notion of invariant 
physical distance does not exist. 


expanding universe as 


2.5.3 Number counts 


A further kinematic test is based on counting the number of cosmological objects 
with a given redshift. Suppose the number of galaxies or clusters per unit volume at 
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a moment of time characterized by redshift z is spatially uniform and equal to n(z). 
Then, the number of galaxies with redshifts between z and z + Az, and within a 
solid angle AQ, is 


AN = n(z)a°(z) ®°(A)AXAQ = n(z)(1 + 2) > ag H |(z) ©(x)AzAQ, 
(2.83) 
where we have used the relation between Ax and Az (see (2.65)). Substituting 
®7(x) from (2.67), we find that in a dust-dominated universe 


AN __ 4n(z) [Qoz + (Qo — 2A + Qoz)!? — DP 
Az AQ HBaé (1 + z)® (1 + Qoz)!/2 





(2.84) 


If we know n(z), then measurement of AN /Az AQ can be used as a test of cos- 
mological models. The difficulty in applying this method is that the number of 
galaxies varies with redshift not only because of the expansion but also as a result 
of dynamical evolution. For example, small galaxies merge to form large ones. 
Conceivably, this problem can be avoided if the number density of some subset of 
galaxies has predictable evolution. 


2.5.4 Redshift evolution 


The redshift of a given object drifts slowly with time due to the acceleration (or 
deceleration) of the universe. The effect is so small that it is not possible to measure 
it using today’s technology. However, we introduce the concept as an example of a 
measurement that could be possible in the coming decades. 

Light from a source located at comoving distance x that we observe today at 
conformal time 79 was emitted at conformal time n, = no — x. The appropriate 
redshift depends on no and is equal to 
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This redshift depends on the time of observation 7 and since x is constant, its time 
derivative is 
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Taking into account that ¢(z) = €9/ (Qa + Qn + z)*) ina universe with a mixture 
of matter and vacuum density, and using this expression in (2.61) for H(z), we obtain 


z=(1+2z) Ho{1—[L- 29+ Qn(L +z) + QA +2) 717}. (2.87) 
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In a flat universe, where Qo = 1 and Qa = 1—Q», the redshift drift, Av = 
Az/(1 + z), is equal to 
zAt 2 
Av = To = HAL{[Qn(1 + 2) +0 = Am) +271 — TY. 2.88) 
The drift is negative for a matter-dominated universe (Q2,, — 1) and positive 
if the cosmological constant dominates (Q2,, — 0). For Q,, = 1 and Q, = 0, its 
magnitude is 





Av © -2(/1+z—1)cms"! 


for observations made a period At = 1 year apart. Although the velocity shift is 
tiny and beyond current detection capabilities, redshift is one of the most precisely 
measured physical observables. Current technology would enable measurements 
of shifts of perhaps 10 m s~! per year. The required improvement by a few or- 
ders of magnitude in the next few decades is conceivable. Such a measurement 
would represent a direct detection of acceleration which would complement the 
luminosity—redshift tests. 


3 


The hot universe 


In the previous chapters we studied the geometrical properties of the universe. 
Now we turn to its thermal history. This history can be subdivided into several 
periods. Here we focus mainly on the period between neutrino decoupling and 
recombination. This period is characterized by a sequence of important departures 
from thermal and chemical equilibrium that shaped the present state of the universe. 

We begin with an overview of the main thermal events and then turn to their 
detailed description. In particular, in this chapter we study the decoupling of neu- 
trinos, primordial nucleosynthesis and recombination. Our considerations are based 
on well understood and tested laws of particle, nuclear and atomic physics below a 
few MeV and, as such, are not likely to be a rich source of future research. How- 
ever, this is important background material which underlies the concept of the hot 
expanding universe. 


3.1 The composition of the universe 


According to the Friedmann equations, the expansion rate of the universe is de- 
termined by the energy density and equation of state of its constituents. The main 
components of the matter composition that played an important role at tempera- 
tures below a few MeV are primordial radiation, baryons, electrons, neutrinos, dark 
matter and dark energy. 

Primordial radiation The cosmic microwave background (CMB) radiation has 
temperature T,9 ~ 2.73 K. Its current energy density is about ¢,,9 ~ 10-7" een? 
and constitutes only 10~> of the total energy density. The radiation has a perfect 
Planckian spectrum and appears to have been present in the very early universe 
at energies well above a GeV. Since the temperature of radiation scales in inverse 
proportion to the scale factor, it must have been very high in the past. 

Baryonic matter This is the material out of which the planets, stars, clouds of 
gas and possibly “dark” stars of low mass are made; some of it could also form 
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black holes. We will see later that the data on light element abundances and CMB 
fluctuations clearly indicate that the baryonic component contributes only a small 
percentage of the critical energy density (QQ, ~ 0.04). The number of photons per 
baryon is of order 10°. 

Dark matter and dark energy The CMB fluctuations imply that at present 
the total energy density is equal to the critical density. This means that the largest 
fraction of the energy density of the universe is dark and nonbaryonic. It is not quite 
clear what constitutes this dark component. Combining the data on CMB, large scale 
structure, gravitational lensing and high-redshift supernovae it appears that the dark 
component is a mixture of two or more constituents. More precisely, it is composed 
of cold dark matter and dark energy. The cold dark matter has zero pressure and can 
cluster, contributing to gravitational instability. Various (supersymmetric) particle 
theories provide us with natural candidates for the cold dark matter, among which 
weakly interacting massive particles are most favored at present. The nonbaryonic 
cold dark matter contributes only about 25% of the critical density. The remaining 
70% of the missing density comes in the form of nonclustered dark energy with 
negative pressure. It may be either a cosmological constant (p, = —é,) or ascalar 
field (quintessence) with p = we, where w is less than —1/3 today. 

Primordial neutrinos These are an inevitable remnant of a hot universe. If the 
three known neutrino species were massless, their temperature today would be 
T, = 1.9 K and they would contribute 0.68 times the radiation density (see Section 
3.4.2). Atmospheric neutrino oscillation experiments suggest that the neutrinos 
have small masses. Even so, it appears that they cannot constitute more than 1% of 
the critical density. 

The universe was hotter and denser in the past. The energy densities of radiation, 
cold matter and dark energy scale with redshift z as 


ey = eyo lL +2), em = ef Qn(l +z), eg =e Qol+2a0™, B.1) 


respectively. Here ¢57 = 3He /87G is the critical density today, (2, is the total 
contribution of baryons and cold dark matter to the current cosmological parameter 
and {29 is the contribution of dark energy. When we go back in time the dark energy 
density grows the least quickly; its impact on the dynamics of the universe becomes 
less than that of cold matter at redshift (see Figure 3.1) 


Zg = (Qg/Qny ” - 1. (3.2) 


This occurs close to the present time, at zg = 0.33 to 1.33, for —1 < w < —1/3, 
Qn © 0.3 and Qo © 0.7. 


Problem 3.1 Find the value of z at which the accelerated expansion begins. 
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Fig. 3.1. 


The radiation energy density grows faster than the density of cold matter and 
eventually becomes dominant at redshift 
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Zeq = —1~ 2.26 x 10°Q,,h25, (3.3) 
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Thus, we can distinguish three dynamically different stages in the expansion 
history: 


¢ the radiation-dominated epoch at z > Zeq ~ 10*, where the universe is dominated by 
ultra-relativistic matter with p = ¢/3 and scale factor increases as a « t!/*; 

¢ the matter-dominated epoch at Zeq > Z > ZQ, where the pressureless components deter- 
mine the expansion rate and a « t?/?; 

¢ the dark-energy-dominated epoch at z < zg, where the component with negative pres- 
sure, p = we, leads to an accelerated expansion and a « f7/X1+™), 


Note that the dark energy cannot begin to dominate too early because a substantial 
period of matter domination is needed for structure formation. In fact, it becomes 
relevant exactly at the present time. This astounding cosmic coincidence is one of 
the greatest mysteries of contemporary cosmology. 


Problem 3.2 How do ultra-relativistic neutrinos influence an estimate for the red- 
shift at which the ultra-relativistic matter begins to dominate? 
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Problem 3.3 Dark energy with equation of state w = —1/3 leads to a term « 1/a* 
in the Friedmann equation (1.67). How can we nevertheless distinguish it from the 
spatial curvature term, k/a’, in an open universe? 


3.2 Brief thermal history 


The temperature of the cosmic radiation decreases as the universe expands. It is 
unambiguously related to the redshift, 


T,(Z) = T,o(1 + 2), (3.4) 


and can be used as an alternative to time or redshift to parameterize the history of 
the universe. To obtain an estimate for the temperature expressed in MeV, at the 
time f measured in seconds, we can use the formula 
O(1) 
Mev ~ ——, 
sec 
which is valid during the radiation-dominated epoch (see Section 3.4.2). 
Below we briefly summarize the sequence of main events constituting the history 
of our universe (in reverse chronological order): 


¢ ~10!°-10" s Galaxies and their clusters are formed from small initial inhomogeneities as 
a result of gravitational instability. Structure formation can be described using Newtonian 
gravity. However, it is still a very complicated nonlinear problem, which can only be 
solved numerically and it is likely to remain an active field of research for a long time. 
One of the main unresolved fundamental issues regarding this period is the nature of dark 
matter and dark energy. 

* ~10'7-10" s At this time nearly all free electrons and protons recombine and form neu- 
tral hydrogen. The universe becomes transparent to the background radiation. The CMB 
temperature fluctuations, induced by the slightly inhomogeneous matter distribution at 
recombination, survive to the present day and deliver direct information about the state 
of the universe at the last scattering surface. Helium, which constitutes about 25% of the 
baryonic matter, has recombined and become neutral before this time. After helium re- 
combination there remain many free electrons and the universe is still opaque to radiation. 
Helium recombination, therefore, is not a very dramatic event, though we must take it 
properly into account when calculating the microwave background fluctuations because 
it influences the speed of sound. 

¢ ~10!! s (T ~ eV) This time corresponds to matter—radiation equality which separates 
the radiation-dominated epoch from the matter-dominated epoch. The exact value of the 
cosmological time at equality depends on the constituents of the dark component and, 
therefore, is known at present only up to a numerical factor of order unity. 

¢ ~200-300 s (T ~ 0.05 MeV) Nuclear reactions become efficient at this temperature. As 
aresult, free protons and neutrons form helium and other light elements. The abundances 
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of the light elements resulting from primordial nucleosynthesis are in very good agree- 
ment with available observation data and this strongly supports our understanding of the 
universe’s evolution back to the first second after the big bang. 

~1s (T ~ 0.5 MeV) The typical energy at this time is of order the electron mass. The 
numerous electron—positron pairs present in the very early universe begin to annihilate 
when the temperature drops below their rest mass and only a small excess of electrons 
over positrons, roughly one per billion photons, survives after annihilation. The photons 
produced are in thermal equilibrium and the radiation temperature increases compared to 
the temperature of neutrinos, which decoupled earlier. 

~0.2 s (T ~ 1-2 MeV) Two important events take place during this period as certain 
weak interaction processes fall out of equilibrium. First, the primordial neutrinos decouple 
from the other particles and propagate without further scatterings. Second, the ratio of 
neutrons to protons “freezes out” because the interactions that keep neutrons and protons 
in chemical equilibrium become inefficient. Subsequently, the number of the surviving 
neutrons determines the abundances of the primordial elements. 

~10-> s(T ~ 200 MeV) The quark—gluon transition takes place: free quarks and gluons 
become confined within baryons and mesons. The physics of the quark—gluon transition 
is not yet completely understood, though it is unlikely that this transition leaves any 
significant cosmological imprints. 

~10-'°-10- 4 s (T ~ 100 GeV—10 TeV) This range of energy scales can still be probed 
by accelerators. The Standard Model of electroweak and strong interactions appears to 
be applicable here. We expect that at temperatures above ~ 100 GeV the electroweak 
symmetry is restored and the gauge bosons are massless. Fermion and baryon numbers 
are strongly violated in topological transitions above the symmetry restoration scale. 
~10~'4-10-* s (10 TeV-10'° GeV) This energy range will probably not be reached by 
accelerators in the near future. Instead, the very early universe becomes, in Zel’dovich’s 
words, “an accelerator for poor people” that can give us some rough information about 
fundamental physics. There is no reason to expect that nonperturbative quantum gravity 
plays any significant role below 10!° GeV. Therefore, we can still use General Relativity to 
describe the dynamics of the universe. The main uncertainty here is the matter composition 
of the universe. It might be that there are many more particle species than are evident 
today. For example, according to supersymmetry, the number of particles species must be 
doubled at least. Supersymmetry also provides us with good weakly interacting massive 
particle candidates for dark matter. 

The origin of baryon asymmetry in the universe is also related to physics beyond 
the Standard Model. There are good reasons to expect that a Grand Unification of the 
electroweak and strong interactions takes place at energies about 10!° GeV. Topologi- 
cal defects, such as cosmic strings, monopoles, that occur naturally in unified theories 
might play some role in the early universe, though, according to the current microwave 
background anisotropy data, it is unlikely that they have any significance for large scale 
structure. 

Perhaps the most interesting phenomenon in the above energy range is the accelerated 
expansion of the universe — inflation — which probably occurs somewhere near Grand 
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Unification scales. It is remarkable and fortunate that the most important robust predictions 
of inflation do not depend substantially on unknown particle physics. Therefore, the 
existence of such a stage may be observationally verified in the near future. 

°~10* s (10! GeV) Near the Planckian scale, nonperturbative quantum gravity domi- 
nates and general relativity can no longer be trusted. However, at energies slightly below 
this scale, classical spacetime still makes sense and we expect that the universe is in a self- 
reproducing phase. Nevertheless, self-reproduction does not eliminate the fundamental 
issues of spacetime structure at the Planckian scale. In particular, the question of cosmic 
singularities still remains. It is expected that these problems will be properly addressed 
in an as yet unknown nonperturbative string/quantum gravity theory. 


3.3 Rudiments of thermodynamics 


To properly describe the physical processes in an expanding universe we need, 
strictly speaking, a full kinetic theory. Fortunately, the situation greatly simplifies 
in the very early universe, when the particles are in a state of Jocal equilibrium 
with each other. We would like to stress that the universe cannot be treated as a 
usual thermodynamical system in equilibrium with an infinite thermal bath of given 
temperature: it is a nonequilibrium system. Therefore, by local equilibrium we 
simply mean that matter has maximal possible entropy. The entropy is well defined 
for any system even if this system is far from equilibrium and never decreases. 
Therefore, if within a typical cosmological time the particles scatter from each 
other many times, their entropy reaches the maximal possible value before the size 
of the universe changes significantly. 
The reaction rate responsible for establishing equilibrium can be characterized 
by the collision time: 
1 
tx“ —, (3.5) 
onv 

where o is the effective cross-section, n is the number density of the particles 
and v is their relative velocity. This time should be compared to the cosmic time, 
ty ~ 1/H, and if 

te K tn, (3.6) 


local equilibrium is reached before expansion becomes relevant. Let us show that 
at temperatures above a few hundred GeV condition (3.6) is satisfied for both 
electroweak and strong interactions. At such high temperatures, all known particles 
are ultra-relativistic and the gauge bosons are all massless. Therefore, the cross- 
sections for strong and electroweak interactions have a similar energy dependence 
and they can be estimated (e.g. on dimensional grounds) as 


2 
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where A ~ 1/p is the de Broglie wavelength and p = E ~ T is the typical momen- 
tum of the colliding ultra-relativistic particles. The corresponding dimensionless 
running coupling constants a vary only logarithmically with energy and are of or- 
der 10~!—10~. Taking into account that the number density of the ultra-relativistic 
species isn ~ T*, we find that 


1 
te~ <=. 3.8 
er (3.8) 
Comparing this time to the Hubble time, 
, 1 1 1 (3.9) 
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we find that at temperatures below T ~ O(1) a? ~ 10-10!” GeV, but above a 
few hundred GeV (where (3.7) is applicable), (3.6) is satisfied and the electroweak 
as well as the strong interactions are efficient in establishing equilibrium between 
quarks, leptons and intermediate bosons. 

The discerning reader might question whether one can apply the formulae for 
cross-sections derived in empty space to interactions which occur in extremely 
dense “plasma”. To get an idea of the strength of the plasma effects, we have to 
compare the typical distance between the particles 1/n'/? ~ 1/T to the “size” 
of the particles ./o ~ a/T. If the coupling constant a is smaller than unity, the 
plasma effects are not very relevant. 

Primordial gravitons and, possibly, other hypothetical particles that interact 
through the dimensionful gravitational constant already decouple from the rest 
of matter at Planckian times and propagate, subsequently, freely. 

Below 100 GeV, the Z and W*+ bosons acquire mass (Mw ~ 80,4 GeV, 
Mz ~ 91,2 GeV) and, thereafter, the cross-sections of the weak interactions begin 
to decrease as the temperature drops. As a result, the neutrinos decouple from the 
rest of matter. Finally the electromagnetic interactions also become inefficient and 
photons propagate freely. All these processes will be analyzed in detail later in the 
chapter, but first we would like to concentrate on the very early stages when known 
particles were in equilibrium with radiation and with each other. In this case, matter 
can be described in a very simple way: all particles are completely characterized 
by their temperature and corresponding chemical potential. 


3.3.1 Maximal entropy state, thermal spectrum, conservation 
laws and chemical potentials 


In this section, we outline an elegant derivation of the main formulae describing the 
maximal entropy state. This derivation is based entirely on the notion of entropy for 


716 The hot universe 


a closed system and does not use any concepts from equilibrium thermodynamics. 
Therefore, it can also be applied to the expanding universe. 

Let us assume that all possible states of some (complicated) closed system can 
be completely characterized and enumerated by a (composite) discrete variable 
a; different a correspond to microscopically different states. If we know that the 
system is in a certain state a, the information about this system is complete and 
its entropy should be zero. This follows from the general definition according to 
which the entropy characterizes the missing information. If, on the other hand, we 
know only the probability P, of finding the system in state a, then the associated 
(nonequilibrium) entropy is 


S=—) > P,yIn Py. (3.10) 


It takes its maximum value when all states are equally probable, that is, P, = 1/T, 
and is equal to 


S=InI, (3.11) 


where I" is the total number of possible microstates which the system can occupy. 
Note that the last expression gives a finite result only if the total energy is bounded, 
otherwise the number of possible states would be infinite. 

Let us calculate the maximal possible entropy of an ideal gas of N bose particles 
with total energy E placed in a box of volume V. It is clear that maximal entropy 
occurs when there are no preferable directions and locations within the box. There- 
fore, given the total energy and number of particles, each state of the system is 
essentially given by the number of particles per mode of the one-particle energy 
spectrum. Let us denote by AWN, the total number of particles, each having energy 
in the interval between € and € + Ae, and by Ag, the total number of different 
possible microstates that a particle could occupy in the one-particle phase space. 
The total number of all possible configurations (microstates) for AN, bose particles 
is equal to the number of ways of redistributing AN, particles among Ag, cells 
(Figure 3.2): 

_ (AN, + Ag. — 1)! 


AG, = : 
(AN, )'(Age — 1)! 





(3.12) 
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Fig. 3.2. 
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The total number of states for the whole system, therefore, is 


PUAN} = I] AG. (3.13) 


Substituting (3.13) into (3.11), we find that the maximal possible entropy of the 
system with the given energy spectrum {AN-,} is 


S({AN,}) = Soin AG: (3.14) 


Let us assume that AN, and Ag, are much larger than unity. Using Stirling’s 
formula, 


N 
N 
1 1 
InNt= Yvinn~ f inxds + inn =(W + 5)inw -W. (3.15) 
n=1 


we find from (3.12) and (3.14) that, to leading order, 


SCAN¢}) = S({ne}) = = (me + IInd + ne) — ne Inn] Age, (3.16) 


where n, = AN, /Ag, are called occupation numbers. They characterize the aver- 
age number of particles per microstate of a single particle. The entropy depends 
on the energy spectrum {n,} and we want to maximize it subject to the given total 
energy 


E({nc}) =) €ANe =D encAge, (3.17) 


and total number of particles 


N((ne}) = DU ANe = Dine Age. (3.18) 


To extremize (3.16) with the two extra constraints (3.17) and (3.18), we apply the 
method of Lagrange multipliers. The variation of expression 


S({ne}) + AE ({ne}) + AN ({ne}) 


with respect to n, vanishes for 


1 
— . 3.19 
i exp(—A1,€ — Az) — 1 ( ) 





Given spectrum (3.19), the Lagrange multipliers 4; and Az are the parame- 
ters which allow us to satisfy the constraints. They can be expressed in terms 
of E and N, or, instead, in terms of temperature 7 = —1/A, and chemical potential 
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[LL = A2T (kp = 1). The distribution function (3.19) then takes the form 
1 
Ne = : 
exp((e — 4)/T) — 1 
This spectrum describes bose particles in a state of maximal possible entropy and is 
known as the Bose-Einstein distribution. A similar derivation can be carried out for 
fermi particles, the only difference being that we have to take into account the Pauli 


exclusion principle, which forbids two fermions from simultaneously occupying 
the same microstate. 





(3.20) 


Problem 3.4 Derive the following expression for the entropy of fermi particles: 


S({ne}) = Yo me — IInd = ne) — ne Inne] Age, (3.21) 


and show that it takes its maximal value for 
1 
Ne = , 
exp((e — #)/T) +1 
Problem 3.5 According to (3.20) and (3.22), the energy of a single particle € can, in 
principle, be larger than the total energy of the whole system FE, which contradicts 


our assumptions. Where does the above derivation fail for € comparable to or larger 
than E? 





(3.22) 


In quantum field theory particles can be created and annihilated, so their total 
number is generally not conserved. In this case the number of particles in equilib- 
rium is determined solely by the requirement of maximal entropy for a given total 
energy. This removes the need to satisfy the second constraint (3.18). If there are 
no other constraints enforced by conservation laws, then the chemical potential uw 
is zero and there remains only one free parameter, 41, to fix the total energy. For 
example, the total number and the temperature of photons are entirely determined 
by their total energy E. 

Because of the conservation of electric charge, electrons and positrons can be 
produced only in pairs. Therefore, the difference between the numbers of electrons 
and positrons N.- — N,+ does not change. With this extra constraint, the Lagrange 
variational principle takes the form 


3[s({ne }) + s({ne }) REE N= =) =0, (3.23) 


where we vary separately with respect to n€ and ne. It is not hard to show that 
the variation vanishes only if the electrons and positrons both satisfy the Fermi dis- 
tribution (3.22) with T = —1/A, and ue- = —e+ = TA2. Thus, the chemical po- 
tentials of the electrons and positrons are equal in magnitude and have the opposite 
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signs as a consequence of electric charge conservation. Only if the total electric 
charge of the electron—positron plasma is equal to zero do they vanish. 


Problem 3.6 Assume particles of types A, B, C, D are in equilibrium with each 
other due to the reaction 


A+B2C+4+D. 


Itis easy to see that the following combinations are conserved: N4 + Nc, Na+ Np, 
Ng +Nc, Ng + Np. Using this fact, show that the chemical potentials satisfy the 
relation 


Mat be =Uc+ bp. (3.24) 


Note that, if electrons and positrons are in equilibrium with each other and with 
radiation due to the interaction e~ + et = y + y, then from (3.24) we recover the 
result, 4e- = —[e+, Since the chemical potential of radiation is equal to zero. 


The above consideration can be directly applied to matter in a homogeneous and 
isotropic expanding universe. If the interaction rate is much larger than the rate of 
expansion, the entropy of matter reaches its maximal value very quickly. In a ho- 
mogeneous universe there are no external sources of entropy, and therefore the total 
entropy of matter within a given comoving volume is conserved. If the interactions 
of some particles become inefficient, they decouple and evolve independently and 
their entropy is conserved separately. For example, after recombination photons 
propagate freely and they are not in thermal equilibrium. Nevertheless, they still 
have maximal possible entropy and hence satisfy the Bose—Einstein distribution 
as if they were in equilibrium. A similar situation occurs for neutrinos when they 
decouple from matter. 

The simple arguments above are not valid when the universe becomes highly 
inhomogeneous as a result of gravitational instability. For this reason the initial 
state of the universe, which looks like a state of “ thermal death” where nothing 
could happen, can evolve to a state where very complicated structures, such as 
biological systems, occur. Nonequilibrium processes and gravitational instability 
will be considered later in detail and here we concentrate on the local equilibrium 
state. It is rather remarkable that in this state general arguments involving only the 
entropy and conservation laws are sufficient to describe the system completely and 
we do not need to use a kinetic theory or go into the details of quantum field theory. 


3.3.2 Energy density, pressure and the equation of state 


To calculate the energy density and pressure for a given distribution function n,, 
we have to determine Ag,, the total number of possible microstates for a single 
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particle having energy within the interval from € to e + Ae. Let us first consider a 
particle with no internal degrees of freedom in one dimension. At any moment of 
time its state can be specified completely by the coordinate x and the momentum 
p. In classical mechanics two infinitesimally different coordinates or momenta 
correspond to microscopically different states. Therefore, the number of microstates 
is infinite and the entropy can be defined only up to an infinite additive factor. 
However, in quantum mechanics, two states within a cell of volume 27h in phase 
space are not distinguishable because of the uncertainty relation. Hence, there is 
only one possible microstate per corresponding phase volume. The generalization 
for the case of a particle with g internal degrees of freedom in three-dimensional 
space is straightforward: 


e+Ae e+Ae 


d*xd*p eV 
Ag. = = dp, 3.25 
oa | (Qrh) = (2ahy i P a 





€ 


where we have assumed homogeneity and integrated over the volume V. Hence- 
forth, we use natural units where c = h = kg = G = 1. The energy € depends on 
the momentum |p| and in the isotropic case we have 





e+Ae 


V V 
ee / pid |p| ~ SS Ve — mee, (3.26) 
272 27? 


€ 


where the relativistic relation, 
e2 = ipl? m?, 


has been used. Note that the state with the minimal possible energy, € = m, drops 
out when the approximate expression in (3.26) is used. This state becomes very 
important when the chemical potential of the bosons approaches the mass of the 
particles. In this case any new particles we add to the system occupy the minimal 
energy state and form a Bose condensate. 

Taking the limit Ae — 0 and considering a unit volume(V = 1), we obtain the 
following expression for the particle number density: 


(oe) 


= ke («2 —m?) 
n= di meAse ae i eS vede, (3.27) 
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Fig. 3.3. 


where the minus sign applies to bosons and the plus to fermions. The energy density 
is equal to 


[o,@) 


2 =e 
e= Sen Age a / Scie ede. (3.28) 





2n? J exp((e — w)/T)F1 
Let us now calculate the pressure. To do so we consider a small area element 
Aon, where nis the unit normal vector. All particles with velocity |v| , striking this 
area element in the time interval between ¢t and t + Af, were located, at t = 0, ina 
spherical shell of radius R = |v|¢ with width |v| At (Figure 3.3). The total number 
of particles with energy €(|v|) within a solid angle AQ of this shell is equal to 


AN =n, Ag-R? |v| AtAQ, 


where Ag, is the number of states per unit spatial volume. Not all particles in the 
shell reach the target, only those with velocities directed to the area element. Taking 
into account the isotropy of the velocity distribution, we find the total number of 
particles striking the area element Aon with velocity v is 


= ene _ (v-n)Ao 


AN, = ———— 
|v| 47 R2 An 


ne Ag. AtAQ. 

If these particles are reflected elastically, each transfers momentum 2(p - n) to the 
target. Therefore, the contribution of particles with velocity |v| to the pressure is 
ns | 2(p-m)AN, _ |p? ipl? 


= A cos’ @ sin@dédy = ——n- Age, 
Ao At One e. | ¥ 3€ wenn 





where we have used the relation |v| = |p| /e and integrated over the hemisphere. 
The total pressure is then 


[oe] 
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Note that massless particles (m = 0) always have an ultra-relativistic equation of 
state, 


= 3.30 
P=] (3.30) 


independent of their spin and chemical potential. 


Problem 3.7 Substituting (3.20) into (3.16) and (3.22) into (3.21), verify that the 
entropy density is 


— eit a Ly (3.31) 
T 
(Hint Prove and then use the relation 
Dp 
7 +5) Age In(l +n), (3.32) 


where the plus and minus signs apply to bosons and fermions respectively. It follows 
that forn. < 1 we have p ~ nT.) 


Verify the following useful relations 


Op dp 
=—, ta 
Ou oT 


The above integrals over energy cannot be calculated exactly when both the mass 


(3.33) 


and chemical potential are different from zero. Therefore, we consider the limits 
of high and low temperature and expand the integral in terms of small parameters. 
At temperatures much larger than the mass the calculation of the leading terms can 
be performed by simply neglecting the mass. However, it is not so easy to derive 
the subleading corrections. The problem is that these corrections are nonanalytic 
in both the mass and the chemical potential. Because the corresponding results are 
not readily available in the literature, we provide below a derivation of the high- 
temperature expansion. The reader who is not interested in these mathematical 
details can skip the next subsection and go directly to the final formulae. 


3.3.3 Calculating integrals 


Changing the integration variable in (3.27), (3.28) and (3.29) from € to x = €/T 
and taking into account the fact that the chemical potentials of particles and an- 
tiparticles are equal in magnitudes and have opposite signs, the calculation of the 
basic thermodynamical quantities reduces to computing the integrals 


IM (a, B) = x, (3.34) 





oe) 
(x? = 2 ee (= aye | 
ex-B S| +f oer ext B=] = 1 
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where 
a 


a= 


m 
jad T 
In particular, the total energy density of particles (p) and antiparticles(p) is equal 
to 


gT* (3), 2 70) 
e = 6p top = 55 (J +07), (3.35) 
and the total pressure is 
ee 
P=Ppt+Pp= eile: (3.36) 
Problem 3.8 Verify that the excess of particles over antiparticles is given by 
3) 
eT? as 


To find the expansions for the integrals ie? and Ie in the limits of high and low 
temperatures, we first calculate the auxiliary integral gr, which for B < @ can 
be written as a convergent infinite series of the modified Bessel functions Ko: 





lo) 
oo np —nBy) ,—Nnx 
2 ei] . = Lae 


n=1 V 7, a? 
=2 Say cosh(nB) Ko(na) . (3.38) 
n=1 


Then, given the expansion for OR (7 B), the functions TICE B) can be obtained 
by integrating the recurrence relation 


(v) 
OJ 
oa 


which follows immediately from the definition of J in (3.34). Note that this 
method works only for odd v. The “initial conditions” for (3.39) can be determined 
by considering the limits « = 0 or @ — oo, where the corresponding integrals can 
easily be calculated. 


= —va Je, (3.39) 





High temperature expansion At temperatures much larger than the mass of the 
particles, that is, for 6 and a much smaller than unity, every term in the series 
(3.38) contributes significantly. In this case we can use a known expansion for the 
sum of modified Bessel functions — formula (8.526) in I. Gradstein, I-Ryzhik, Table 
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of Integrals, Series, and Products (San Diego: Academic Press, 1994). The result 
for purely imaginary 6 can be analytically continued to the real 6 and we obtain 


cy _ | (aw? — 6?) '” +dn@/4r) + C) + O(a, f?), ais 
= —(In(a/z) + C) + O(a’, B’), 


for bosons and fermions respectively. Here C ~0.577 is Euler’s constant and 
O (2, B’) denotes terms which are quadratic and higher order in a and £. 


Problem 3.9 Verify that the next subleading correction to (3.40) is 
76(3) 
870? 


where ¢ is the Riemann zeta function. 





(a? + 26°), 


To determine ye and I? from (3.39) and (3.40), we need the “initial conditions” 
IQ = 0, 6). Setting a = 0 and changing the integration variables, we can rewrite 
the expression in (3.34) as 


oo 0 B 
0) 2 PE BY EO By. (y + B)” (=i): 
ss ad wey een ere 
0 0 


—B 
(3.41) 





Replacing y by —y in the last integral and noting that 


1 1 


=71, 
Wei” ea af 





we obtain for odd v 


f(y +B) + — By, _ pt 





I, B) = d 3.42 
0, B) eal re (3.42) 
0 
It follows that 
172 _ 1B? 
ORS sds (3.43) 
= 172 + 1 2. 
Substituting (3.40) into (3.39) and taking into account (3.43), one finds 
ag — 1g? — Ja? — p? — aye in(—) + c-- +0 
a) 3 2 2 An 2 , 
ee 
1 1 1 a 1 
an? + = 6? + -a( In(—) oes 20, 
eo a Tae (m( at ag (3.44) 
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where O = O(a, B’) . Similarly we obtain 


2 1 
ol + 50 (26° —a’)+2(a? - ee a —A+a‘o, 

pore : ; : (3.45) 
rial + gt (2B —a’)+A— qiin2) at + ato, 


where 


A ae 2B* — 6a? B? — 304 In ae 
8 47 e3/4 ; 


Low-temperature expansion In the limit of small temperatures we have a = 
m/T > 1 and Ko(na) « exp(—na). Therefore, for a — B > 1, all terms on the 
right hand side in (3.38) are negligible compared to the first term: 


JOY ~ 2Ko(a) cosh B. (3.46) 


Integrating (3.39) and taking into account that J ee 


4 must vanish as a — oo, we 
obtain 





3 
JD ~ 2aK (a) cosh B = V2xae~* cosh B 1 et ow) (3.47) 
a 
and 
15 
JO ~ 6(a? Ko(a) + 2aK(a)) cosh B ~ V18703e~* cosh a(1 p =) (3.48) 
a 


These formulae allow us to calculate the basic thermodynamic properties of nonrel- 
ativistic particles when a — 6 > 1. In such cases the exponential term dominates 
the denominator of the integrand in (3.34) and the difference between Fermi and 
Bose statistics becomes insignificant because the occupation numbers are much 
less than unity. We will see in the next section that this case is the situation most 
relevant for cosmological applications. 


3.3.4 Ultra-relativistic particles 


Bosons For bosons the maximal value of the chemical potential cannot exceed 
the mass, j4, < m. Assuming that both a and 6 are much smaller than unity and 
substituting (3.45) into (3.37), we find that at high temperatures the excess of 
particles over antiparticles to leading order is 

gT? by 
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To estimate the number density of ultra-relativistic bosons we set m = 4p = O in 
(3.27) and then obtain 

ny X 5 er’, (3.50) 
where ¢(3) © 1.202. From (3.49) and (3.50) one might be tempted to conclude 
that at high temperatures the excess of particles over antiparticles is always small 
compared to the number density of the particles themselves. This conclusion is 
wrong, however. The expression in (3.49) is applicable only if 4, < m.As up > m, 
new particles added to the system fill the minimal energy state € = m, which is 
not taken into account in (3.49). These particles form a Bose condensate which can 
have an arbitrarily large particle excess. 


Problem 3.10 Given a particle excess per unit volume An, find the temperature Tz 
below which the Bose condensate forms. Assume that JT, >> m and determine when 
this condition is actually satisfied. How much does a Bose condensate contribute 
to the total energy density, pressure and entropy? 


If no Bose condensate is formed, the excess of bosons over antibosons is small 
compared to the number density. In this case the energy densities of particles and 
antiparticles are nearly equal and it follows from (3.35) and (3.45) that 

epte, m7? 
es ~—eT". 3.51 
Eb 5 308 Gio!) 


The pressure and the entropy density are 





Eb 4 &p 2n* 
aE ae Np, 
dae OS aT  Ase@) ” 





(3.52) 


respectively. For massless bosons the chemical potential should be equal to zero. 
In this case (for example, for photons) all equations above are exact. 


Fermions The chemical potential for fermions can be arbitrarily large and can 
exceed the mass. We first derive the exact formulae for an arbitrary ju ¢ in the limit 
of vanishing mass. Taking a — 0 in (3.44) and (3.45) and substituting the result 
into (3.35), we obtain 


Tn? 3062 156% 
oe a cae aor, 3.53 
EE EE = F998 ae oe | Go) 





where 6 = ¢/T. The pressure is equal to one third of the energy density as 
expected for massless particles. It follows from (3.37) that the excess of fermions 


3.3 Rudiments of thermodynamics 87 


over antifermions is 


T° Bp 
—np=—fhl14+—]. 3.54 
Substituting the expressions above into (3.31) for the entropy density, we obtain 
Tn 1S? 
¢= —_ eT? |14 ——_|, 3.55 
BEES cae + Fe re 


If the chemical potential is much larger than the temperature, the main contribution 
to the total energy density comes from the degenerate fermions and is equal to 
guy /877. These fermions fill the states with energies smaller than the Fermi energy 
€r = (4, which determines the Fermi surface. The temperature correction to the 
energy, which to leading order is of order gT* wy /4, is due to the particles located 
in the shell of width T near this Fermi surface. One can see from (3.55) that the 
only states which contribute to the entropy are those near the Fermi surface. As 
the temperature approaches zero, the entropy vanishes. In this limit all fermions 
occupy definite states and information about the system is complete. Note that the 
antiparticles, for which j47 < 0, disappear as the temperature vanishes. 
If 8 < 1, then 
gT° 

et er (3.56) 
and the excess of fermions over antifermions is small compared to the number 
density. In this case we can neglect the chemical potential in (3.27) and, to leading 
order, the number densities of fermions and antifermions are the same, namely, 


a BOK) 
a 4n? 


The energy density, pressure and entropy density of the fermions are 





eT?) (3.57) 


Lie nw EF pee 
240 3 3 T 
respectively. If the mass is small compared with the temperature but nonzero, there 
exist mass corrections, as can be inferred from the formulae derived in the previous 
subsection. They are nonanalytic in a = m/T and if 8 = u/T £0, cross-terms 

simultaneously containing mass and chemical potential are also present. 

Finally, we note the useful relation between the entropy of ultra-relativistic 
fermions with a small chemical potential and the entropy of ultra-relativistic bosons 
when the two types of particles have the same number of internal degrees of freedom: 


7 
sr= gob: (3.59) 


(3.58) 
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3.3.5 Nonrelativistic particles 
If the temperature is smaller than the rest mass and in addition 
— fb 
T 


spin-statistics do not play an essential role and the formulae for bosons and fermions 
coincide to leading order. Substituting (3.48) into (3.37), we find that in this case 


n-ne 2e( Se) exp(—=) sinh( =) [ + eal (3.60) 





> I, 


It follows that the number density of particles is 


zs Tm\?? m— UL 15 T 
nXg oe exp| — 7 ae : (3.61) 


and the number density of antiparticles, 7, is suppressed by a factor of exp(—2u/T) 
compared to n and if 44/T > 1 the antiparticles can be neglected. In the early 
universe the number density of any type of nonrelativistic species never exceeds 
the number density of photons, that is, n <n, ~ T?, and hence the inequality 
(m — pw) /T > 1is fulfilled. The energy density of particles is obtained by substi- 
tuting (3.47) and (3.48) into (3.35), and can be expressed in terms of the particle 
number density as 





3 
éex2mn+ qn. (3.62) 


The pressure p ~ nT is much smaller than the energy density and can be neglected 
in the Einstein equations. The entropy density of the nonrelativistic particles can 
easily be calculated from (3.31) and is equal to 


m-wm 5 
a =)n. 3.63 
Ss ( T +3)n ( ) 





Problem 3.11 Ifm/T > 1 but|m — | /T « 1, one cannot ignore spin-statistics. 
In this limit, however, the antiparticles are suppressed by a factor of exp(—2m/T) 
and hence can be neglected. Calculate the corresponding energy density, pressure 
and entropy for bosons and fermions in this case. Given a number density n, verify 
that at temperatures below Tg = O(1)n?/3/m a Bose condensate is formed. 


The chemical potential of fermions can be arbitrarily large and may signif- 
icantly exceed the mass. If (us —m)/T >> 1, most fermions are degenerate. 
When jw >> m, fermions near the Fermi surface have momenta of order ju ¢ 
and are therefore relativistic, so we can use the results in (3.53)—(3.55). Other- 
wise, if (uw —m) <m, the gas of degenerate fermions is nonrelativistic and the 
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corresponding formulae are the standard ones found in any book on statistical 
physics. Having completed our brief review of relativistic statistical mechanics, we 
now apply the results derived to the early universe. 


3.4 Lepton era 


When the temperature in the universe drops below a hundred MeV(at t > 107‘ s), 
the quarks and gluons are confined and form color-singlet bound states — baryons 
and mesons. We recall that baryons are made out of three quarks, each of which 
has baryon number 1/3, while the mesons are bound states of one quark and one 
antiquark, so that their resulting baryon number is zero. 

The main ingredients of ordinary matter at temperatures below 100 MeV are pri- 
mordial radiation (7), neutrons (7), protons (p), electrons and positrons (e-, e+), 
and three neutrino species. Mesons, heavy baryons, j- and t-leptons are also 
present, but their number densities are very small and become increasingly negli- 
gible as the temperature decreases. 

At energies of order a few MeV, the most important processes involve the weak 
interactions in which leptons, such as neutrinos, participate. Therefore, one calls 
this epoch the lepton era. At low energies, the baryon number and the lepton 
numbers are each conserved. The total electric charge is obviously also conserved. 
To enforce these conservation laws, a chemical potential is introduced for each 
particle species. The number of the independent potentials, however, is equal to the 
number of conserved quantities; any remaining potentials are expressed through 
these independent potentials using the chemical equilibrium conditions (3.24). 

To demonstrate this, let us consider a medium containing the following ingredi- 
ents: photons, leptons e, jz, T, neutrinos ve, vz, Vr, the lightest baryons p, n, A, 
and mesons °°, 7*. The corresponding antiparticles are also present in the state of 
equilibrium. To enforce the conservation laws for electric charge, baryon number 
and the three different lepton numbers, we take as independent the following five 
chemical potentials: We-, Mn, Hy., Hv,» /v,- All other potentials will be written in 
terms of the members of this set. To start with, 


Ln =0 (3.64) 
because, as a result of electromagnetic interaction, the 7° meson quickly decays 
(tz0 ~ 8.7 x 107!” s) into photons (1° + yy) which have fy =0. From A > 
n° (t, ~ 2.6 x 107!°s), we find 

MA = Un + Mo = Mn. (3.65) 
The muon is unstable (Gi re 22% 10° s) and decays into an electron, an antineu- 
trino and a neutrino, ~~ — e vev,, and hence 


My = Me- — by, + My, (3.66) 
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The t-lepton also decays, for example into e~, d., v,, therefore 


Mr = Me — by, + My,- (3.67) 


Finally, from the reactions 7~ — v,ju~ and pe” = nv,, we deduce that 


Ma- = Le — Py, (3.68) 


and 


Lp = Mn + My, — Mer: (3.69) 


All other possible reactions lead to relations that are consistent with the ones above. 
We recall that the chemical potentials for antiparticles are equal in magnitude to 
the chemical potentials for their corresponding particles but have opposite sign. 
The five independent chemical potentials can be expressed through the five 
conserved quantities. The conservation of the total baryon number means that the 
number density of baryons minus antibaryons decreases in inverse proportion to 
the third power of the scale factor a. If matter is in equilibrium, the total entropy 
is conserved and, as the universe expands, the entropy density s also scales as a~*. 


Therefore, the baryon-to-entropy ratio 


_ Any + An, + Ang 


S 


B 





(3.70) 


remains constant. We denote here by An = n — 7 the excess of the corresponding 
particles over their antiparticles. Similarly, the conservation law for total electric 
charge can be written as 

An, — An, — An, — An, — An,- 


Q= = const, (3.71) 
Ss 





and for each type of lepton number we have 
_ An; + An, 


Ss 


L; = const, (3.72) 


where i = e, yz or tT. Because all An can be expressed through the temperature T 
and the corresponding chemical potentials, the system of equations (3.70), (3.71) 
and (3.72), together with the conservation law for the total entropy, 


d(sa*) 
dt 


allow us to determine the six unknown functions of time: T(t), We-(t), Un(t), My, (0), 


Hy, (0), My, (t). 
What is known about the numerical values of B, Q and L;? The universe appears 
to be electrically neutral and hence Q = 0. The baryon-to-entropy ratio is rather well 





= 0, (3.73) 
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established from observations and is of order B ~ 10~!°—10~-°. This means that 
the entropy per one baryon or, equivalently, the number of photons (ny ~s~ T°) 
per baryon, is very large, ~ 10°-10!°. The lepton-to-entropy ratios L; are not so 
well established. The most severe limits on L; are indirect. We will see in the next 
chapter that the total fermion number is not conserved at temperatures higher than 
100 GeV and, as a result, the combination 


B+a(L.+L, +L), 


where a ~ O(1), vanishes. Hence, if there are no special cancellations between the 
lepton numbers, their absolute values cannot significantly exceed the baryon num- 
ber, that is, |L;| < 10~°. Limits from more direct observations are much weaker. 

If the temperature is higher than the mass of a particular particle, the particle 
is relativistic and many particle—antiparticle pairs are created from the vacuum, 
so that the number density of pairs is of order the number density of photons, 
Ny = T°. As the temperature drops below the mass, most of these pairs annihilate 
and finally only the particle excess survives. Let us determine when the numbers of 
particle—antiparticle pairs become negligible. The particle excess is characterized 
by a constant number 


n-n 


p= (3.74) 
S 





which can be either a baryon number, a lepton number or electric charge. Solving 
this together with the equation 


nn m\3 2m 
= ~(=) exp = ae ri (3.75) 


which follows from (3.61), we obtain 

B Bp? Gy 2m \_ 
pg REP Ey 
i B p 4) 2m 
oO Ce 


It is clear that the number density of particle—antiparticle pairs becomes negligible 





~) 





a 


(3.76) 





= 





compared to the particle excess when the second term under the square root becomes 
smaller than the first. For 6 « 1 this occurs at 


Lage a tee Py Pe 3.77 
F>(s) +3((5) +). oe 


For example, if 6 ~ 10~°, the particle-antiparticle pairs can be neglected when 
the temperature drops by a factor of 25 below the mass. Thus, the number of 
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baryon—antibaryon pairs becomes small compared to the baryon excess at temper- 
atures below 40 MeV while positrons can be neglected at T < 20 keV. 

At low temperatures, the conserved charge is mostly carried by the light- 
est particles possessing the given charge. For example, taking into account that 
La = Un, from (3.60) we obtain 


A 32 — mp 176 MeV 
= lo ue exp an ~ exp eee i (3.78) 
Any Mn T T 


Thus, at 7 < 176 MeV, the contribution of A particles to the total baryon number 
can be discarded and the baryon asymmetry is due to the lightest baryons — protons 
and neutrons. Similarly, at temperatures below 100 MeV, the electric charge excess 
carried by leptons and mesons is mostly due to the overabundance of electrons, since 
ji- and t-leptons and the lightlest-charged mesons have relatively large masses, 
namely, m,, ~ 106 MeV, m, ~ 1.78 GeV and m,+ ~ 140 MeV. 


3.4.1 Chemical potentials 


At temperatures higher than a few MeV, the weak and electromagnetic interactions 
are efficient and baryons, leptons and photons are in local thermal and chemical 
equilibrium. Note that in general, thermal and chemical equilibria are distinct. For 
example, while strong and electromagnetic interactions keep neutrons, protons and 
radiation at the same temperature, if the weak interaction rate is smaller than the 
expansion rate the chemical potentials of protons and neutrons do not need to satisfy 
a chemical equilibrium condition. 

At temperatures below 100 MeV, we can neglect all heavy baryons and leptons. 
Let us estimate the chemical potentials of various matter components at these 
temperatures, beginning with neutrinos. Assuming that the lepton numbers L; are 
much smaller than unity, we find from (3.72) and (3.54) that 


Mle fois (3.79) 


where the entropy density is estimated as s ~ T? and we have taken into account 
that the main contribution to L,,,, comes from v,,,, because the t- and jz-leptons 
have large masses. The electrons are the lightest leptons which carry the electric 
charge needed to compensate the electric charge of the baryons. Therefore, their 
contribution to L, is not negligible and the estimate, analogous to (3.79), applies to 
the sum j1, + fy, rather than to the chemical potential of electron neutrinos alone. 
We see that the chemical potentials of relativistic particles decrease in proportion 
to the temperature as the universe expands. 
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As we found, at T < 40 MeV, antibaryons can be neglected and, therefore, 
ANnp,n  Np,n- The conservation law of the total baryonic charge (3.70) implies that 


Np Nn 
Bry—(14+— (3.80) 


Ss Np 


remains constant. The factor inside the parentheses is of order unity. Using formula 
(3.61) for n,, we obtain 


Mp — p(t) 1 sm, 3/2 
a ~in( 5 ; ) ) (3.81) 





For B ~ 107!°, the chemical potential ft, changes from about —115 MeV to 
+ 967 MeV as the temperature drops from 40 MeV to 1 MeV. The number density 
of protons decays as T? « a~>. Substituting (3.81) into (3.63), we obtain the 
following estimate for the contribution of protons to the total entropy, 


Sp (Mp —bpt) _ 5\ Np a Mp 3/2 
5 =I T(t) +5) s ~an(5 (7) ) oe) 





Thus, nonrelativistic protons contribute only a small fraction of order 10~® to the 
total entropy. It is interesting to note that the total entropy of protons themselves is 
not conserved. It follows from (3.82) that this entropy logarithmically increases as 
the temperature decreases. This has a simple physical explanation. If nonrelativistic 
protons were completely decoupled from the other components, their temperature 
would decrease faster than the temperature of the relativistic particles, namely, as 
1/a? instead of 1/a. Therefore, to maintain thermal equilibrium with the dominant 
relativistic components, the protons borrow the energy and entropy needed from 
them. 


Problem 3.12 Verify that the conservation of entropy and of the total number of 
nonrelativistic particles implies that their temperature decreases in inverse propor- 
tion to the second power of the scale factor. How does the chemical potential depend 
on the temperature in this case? 


To estimate the chemical potential of electrons, j4., we use the conservation law 
for the electric charge (see (3.71)). Because the universe is electrically neutral, we 
have Q = 0. Taking into account that electrons are still relativistic at T > 1 MeV 
and skipping the negligible contributions from t- and jz-leptons and z~ mesons in 
(3.71), we find 

[Le Ane An 


a = ~B~ 107". (3.83) 





94 The hot universe 


This is not surprising because we need only a small excess of electrons to compen- 
sate the electric charge of the protons. 

Finally, let us estimate the ratio of the number densities of neutrons and protons 
when they are still in chemical equilibrium with each other and with leptons. At the 
beginning of this section, we found that chemical equilibrium implies 4, — Un = 
ty, — Me. Using this relation together with (3.61), one immediately obtains 


wt = exp(—™e— Me Fn He ~exp(-2), (3.84) 
Np T T 





where Q =m, — m, ~ 1.293 MeV and we have neglected jz, and 1, in the latter 
equality. The relation above will be used to set up the initial conditions for primordial 
nucleosynthesis. 


3.4.2 Neutrino decoupling and electron—positron annihilation 


At early times, the main contribution to the energy density comes from relativistic 
particles. Neglecting the chemical potentials, (3.51) and (3.53) imply that their total 
energy density is 


& = KT", (3.85) 
where 
a ae (3.86) 
BE BOC ae) 


and g, and g are the total numbers of internal degrees of freedom of all relativistic 
bosons and fermions respectively. Let us calculate « in the universe when the 
only relativistic particles in equilibrium are photons, electrons, the three neutrino 
species and their corresponding antiparticles. Photons have two polarizations, and 
SO gp = 2. Electrons have two internal degrees of freedom, but each type of neutrino 
has only one because neutrinos are left-handed. The antiparticles double the total 
number of fermionic degrees of freedom and, therefore, g = 10. Thus, in this 
case, K ~ 3.537. Every extra bosonic or fermionic degree of freedom changes x by 
Ak, ~ 0.329 or Ax ~ 0.288 respectively. 

Comparing (3.85) to (1.75), we find the relation between the temperature and 
the cosmological time in a flat, radiation-dominated universe: 


3 1/2 
t= (= ) (Se (3.87) 
JT K 
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Converting from Planckian units, we can rewrite this relation in the following useful 
form: 


3 \? (Tp1\? 1 
tao = ti( ) (=) ~ 1.391, (3.88) 


320K T MeV 


where the cosmological time and temperature are measured in seconds and MeV 
respectively. 

When the temperature decreases toa few MeV, thatis, about a second after the big 
bang, weak interactions become inefficient. These interactions are important in two 
respects. First, they keep neutrinos in thermal contact with each other and with the 
other particles, and second, they maintain the chemical equilibrium between protons 
and neutrons. The two events, namely, the thermal decoupling of neutrinos and the 
chemical decoupling of baryons, are somewhat separated in time. The first happens 
when the temperature is about 1.5 MeV, while the second occurs at T ~ 0.8 MeV. 
The chemical decoupling of the baryons is essential for nucleosynthesis and it will 
be considered in detail in the next section. Here we concentrate on the thermal 
decoupling of neutrinos. 

The main reactions responsible for the coupling of the electron neutrinos to the 
relativistic electron—positron plasma, and hence to radiation, are 


e+e 2y4+ h, Lise Pi, eo + ti, > oF +z, (3.89) 


Some diagrams describing these interactions in electroweak theory (see next chap- 
ter) are shown in Figure 3.4. Both charged W+-bosons and the neutral Z-boson 
contribute to these processes. At energies much smaller than the masses of the in- 
termediate bosons the propagators of the Z- and W-bosons reduce to 1/M aE 7 and 
Fermi theory can be used to estimate the cross sections. For relativistic electrons 
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we have 


a2 


Oey ~ O11) — 2 (pi + pry, (3.90) 
My,z 





where a, ~ 1/29 is the weak fine structure constant and p, 2 are the 4-momenta of 
the colliding particles. The neutrinos decouple from the electrons when the collision 
time, 


ij = (Gany  2od)a, Myr, (3.91) 


becomes of order the cosmological time t, which, in turn, is related to the tem- 
perature via (3.87). When deriving (3.91) we have assumed that the electrons are 
relativistic and hence (p, + p2)* ~ T* and n, ~ T?. Comparing (3.91) to (3.87), 
one finds that the electron neutrinos v, decouple at temperature 


T,, ~ OU)a?F My? . (3.92) 


The exact calculation shows that the numerical coefficient in this formula is not 
much different from unity and hence 7, ~ 1.5 MeV. 

At temperatures of order MeV, the number densities of j- and t-leptons are 
negligibly small and the only reactions enforcing thermal contact between j- and 
T-neutrinos and the rest of matter are the elastic scatterings of v,, on electrons 
(€Vy,r > e€V,,r); these are entirely due to Z-boson exchange. As a consequence, 
the cross-sections for these reactions are smaller than the total cross-section of 
the ev, interactions and the j- and t-neutrinos decouple earlier than the electron 
neutrinos. 

The most important conclusion from the above consideration is that all three 
neutrino species thermally decouple before the electron—positron pairs begin to an- 
nihilate at T ~ m, ~ 0.5 MeV. After decoupling, the neutrinos propagate without 
further scatterings, preserving the Planckian spectrum. Their temperature decreases 
in inverse proportion to the scale factor and is not influenced by the subsequent e* 
annihilation. The energy released in the electron—positron annihilation is thermal- 
ized and as a result the radiation is “heated.” Therefore, the temperature of radiation 
must be larger than the neutrino temperature. Let us calculate the radiation-to- 
neutrino temperature ratio. After decoupling the neutrino entropy is conserved sep- 
arately. The total entropy of the other components, which is dominated by radiation 
and the electron—positron plasma, is also conserved. Hence the ratio 


Sy + Set 


Sy 
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remains constant. Taking into account that s,, a i and s, « T3, we have 


(2) (+%)qc (3.93) 
Ty a 


where C is a constant. Just after neutrino decoupling, but before e* 


T, = T, and s,+/s, = 7/4 (see (3.59)). Therefore, C = 11/4 and 


is i \ Ce eae 
(alg) (ee) 690 


When the electron—positron pairs begin to annihilate at T ~ 0.5 MeV the ratio of 
entropies s-+/s, decreases and finally becomes completely negligible (see (3.82), 
where one has to substitute m, instead of m,). Hence, after electron—positron 


T, 


Thus, the massless primordial neutrinos should have a temperature today of 
T, =~ 2.73 K/1.4 = 1.95 K. Unfortunately it is not easy, if even possible, to detect 
the primordial neutrino background and verify this very robust prediction of the 
standard cosmological model. 


annihilation, 


Problem 3.13 Assuming that neutrinos have a small, but nonvanishing mass, esti- 
mate their temperature today. 


Problem 3.14 Calculate the contribution of neutrinos to the energy density 
after e* annihilation and determine at which redshift z the total energy density 
of radiation and relativistic neutrinos is exactly equal to the energy density of cold 
(nonrelativistic) matter. 


3.5 Nucleosynthesis 


The most widespread chemical element in the universe is hydrogen, constituting 
nearly 75% of all baryonic matter. Helium-4 constitutes about 25%. The other light 
elements and metals have only very small abundances. 

Simple arguments lead to the conclusion that the large amount of *He could not 
have been produced in stars. The binding energy of *He is 28.3 MeV, and therefore, 
when one nucleus of *He is formed, the energy released per one baryon is about 
7.1 MeV ~ 1.1 x 10> erg. Assuming that one quarter of all baryons has been 
fused into *He in stars during the last 10 billion years (3.2 x 10!’s), we obtain the 
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following estimate for the luminosity-to-mass ratio: 


Lol 1.1 x 107° erg ~ 5 28 
Moar  4(1.7 x 10-24 gm) x (3.2 x 1017s) gms ~ Mo’ 





where Mo and Lo are the solar mass and luminosity respectively. However, the ob- 
served L/ Mya, < 0.05L6/Mo, and therefore, if the luminosity of baryonic matter 
in the past was not much larger than at present, less than 0.5% of “He can be fused 
in stars. 

The only plausible explanation of the helium abundance is that it was produced in 
the very early hot universe when the fusion energy constituted only a small fraction 
of the total energy. The energy released was then thermalized and redshifted long 
before the universe became transparent. It is obvious that a substantial amount of 
helium cannot be formed before the temperature drops below the binding energy 
~ 28 MeV. Indeed, primordial nucleosynthesis took place at temperature roughly 
0.1 MeV, that is, a few minutes after the big bang. The amount of helium produced 
depends on the availability of neutrons at this time, which, in turn, is determined by 
the weak interactions maintaining the chemical equilibrium between neutrons and 
protons. These weak interactions become inefficient when the temperature drops 
below a few MeV and, as a consequence, the neutron-to-proton ratio “freezes out.” 
Thus, the processes responsible for the chemical abundances of primordial elements 
began seconds after the big bang and continued for the next several minutes. 

In this section, we use analytical methods to calculate the abundances of the light 
primordial elements. Although more precise results are obtained with computer 
codes, the quasi-equilibrium approximation used here reproduces the numerical 
results with surprisingly good accuracy. In addition, the analytical methods allow 
us to understand why and how the primordial abundances depend on cosmological 
parameters. 


3.5.1 Freeze-out of neutrons 


We begin with the calculation of the neutron freeze-out concentration. The main 
processes responsible for the chemical equilibrium between protons and neutrons 
are the weak interaction reactions: 


n+vepte, nt+et 2 p4yp. (3.96) 


Here v always refers to the electron neutrino. To calculate the reaction rates, we can 
use Fermi theory according to which the cross-sections can be expressed in terms 
of the matrix element for the four-fermion interaction represented in Figure 3.5: 


IM|? = 16(1 + 39%) Gi(Dn * Pv) (Pp * Pe)s (3.97) 
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n p 


where 


Tay 
J2Mi, 


is the Fermi coupling constant and (p; - p;) are the scalar products of the 4-momenta 

entering the vertex. The factor g4 ~ 1.26 corrects the axial vector “weak charge” 

of the nucleon by accounting for the possibility that gluons inside the nucleon split 

into quark—antiquark pairs, thus contributing to the weak coupling. Note that the 

Fermi constant can be determined to very high accuracy by measuring the lifetime 

of the muon, while g,4 can be measured only in interactions involving nucleons. 
For the process a + b > c +d, the differential cross-section is 





Gr= ~ 1.17 x 107° GeV~? 





ren 1 2 . 2 2a 2, 1/2 
Cab IM| G Pa) nnd | oo 


dQ ~ (82)?(pa + Po)” \(Pa* Po)? — m2m? 


This expression is manifestly Lorentz invariant and can be used in any coordi- 
nate system. The 4-momenta of the outgoing particles c and d are related to the 
4-momenta of the colliding particles a and b by the conservation law: p. + pa = 
Pa + Po. 

Let us now consider the particular reaction 


n+v7>pte. 


At temperatures of order a few MeV the nucleons are nonrelativistic and we have 


(Pn ale ran = m?, (Pn , Pv) = Mp), 


{> : Bey = meme {= Mp€ev 1 —(me/€e)” = MpEeve, 


where €, is the energy of the incoming neutrino and e, ~ €, + Q is the energy of the 
outgoing electron. The energy Q ~ 1.293 MeV, introduced in (3.84), is released 
when the neutron is converted into the proton. Expression (3.98) is valid only in 
empty space. At temperatures above 0.5 MeV there are many electron—positron 
pairs and the allowed final states for the electron are partially occupied. As a result, 





(3.99) 
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the cross-section is reduced by the factor 
-1 
l-n,, = [1 + exp(—€e/T)| 


to account for the Pauli exclusion principle. Given this factor, the substitution of 
(3.99) and (3.97) into (3.98) gives 


~ 


G2.e20, [1 + exp(—e./T)] (3.100) 





Onv 


1+3¢7 
4 


Because the number density of the nucleons is negligible compared to the number 
density of the light particles, the spectra of neutrinos and electrons are not signif- 
icantly influenced by the above reactions and always remain thermal. Hence, the 
nv interactions occurring within a time interval Af in a given comoving volume 
containing N,, neutrons reduce the total number of neutrons by 


AN, = -(5 Sa) Mh (3.101) 


€v 


where 
ne, = [1+ exp(e,/T,)] 


is the neutrino occupation number and Ag,, is the phase volume element (see 
(3.26), where V = g = 1). The velocity of neutrinos v, is equal to the speed of 
light: v, = 1. 

It is useful to introduce the relative concentration of neutrons 
— Nn ™ 
= Nn + Np = Nn +Np 





(3.102) 


n 


Taking into account that the total number of baryons, N,, + N,, is conserved, and 
substituting (3.100) into (3.101), we find that the rate of change of X,, due to the 
nv reaction 1s 








dX, 14384 42 95 
= —hyyX, = ———-G J 1; Xn, 3.103 
(4 ‘ FFA G3.0° (100) (3.103) 
where 
b 
2 2 2 
; —~1¥d 
rane f fae oa (3.104) 
VE Gee (te®) 


and the integration variable is 


q =(€,/Q)+ l= €./Q. 
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Before electron—positron annihilation the temperatures of the electrons and neu- 
trinos are equal, that is, T = 7,. To estimate the integral in (3.104) we note that 
(m./Q)° ~ 0.15 and expand the square root in the integrand, keeping only the first 
two terms. Furthermore, ignoring the Pauli exclusion principle for the electrons or, 
equivalently, neglecting the second term in the denominator, we can calculate the 
resulting integrals and obtain 


45665) (T,\? , Tx4# (T,\* , 3603) tw \ (Ts \? 
sao = (5) +a) ta (aH )(S) G99 


It is quite remarkable that this approximate expression reproduces the exact result 





with very good accuracy at all relevant temperatures. For example, for 7,/Q > 1, 
the error is about 2%, improving to 1% or better for T,/Q < 1. Substituting (3.105) 
together with the values of Gy and Q into (3.103), and converting from Planckian 
to physical units, we find 


RAITT zs 
Anv © 1.63{( — } | —+0.25] s7!. 3.106 
(5) ) : vo 


Further simplifications made to obtain this last expression do not spoil the accuracy; 
at the temperatures relevant for freeze-out , J, > 0.5 MeV, the error remains less 
than 2% . 


Problem 3.15 Verify that the reaction rate for net — pi is equal to 


jee 1+ 324 
203 


where J is the integral defined in (3.104). Check that if 7, = T and T > m,, then 
Ane & Any. Consider the inverse reactions pe~ — nv and pid —> net. Show that 





G2 0° (-0o -*) (3.107) 
F ey . 


for 7, = T their rates can be expressed through the rates of the direct reactions: 


Ape = exp(—Q/T) Anvs Apo = exp(—Q/T) Ane: (3.108) 


Freeze-out The inverse reactions increase the neutron concentration at a rate 
Ap» nX p. The balance equation for X,, is therefore 
dXy 
dt 


where An p = Ane + Any and Apon =Ape + App are the total rates of the direct 
and inverse reactions respectively, and 





= Insp Xn hponXp = —anop(1 + e-#) (Xn — X84), (3.109) 


xis — a (3.110) 
1+exp(Q/T) 
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is the equilibrium concentration of neutrons. To obtain the second equality in (3.109) 
we used the relations in (3.108), assuming T,, = 7, as well as the fact that the proton 
concentration is X, = 1 — Xp. 

The exact solution of the linear differential equation (3.109), with the initial 
condition X, > X;2ast — 0, is 

t t 
X,(t) = X°(r) — / exp| — / dn» pO(I se ef) di | X°(7)d?, (3.111) 
0 7 
where the dot denotes the derivative with respect to time. 

The second term on the right hand side in (3.111) characterizes the deviation 
from equilibrium and is negligible compared to the first term at small t. Integrating 
by parts, we can rewrite the solution (3.111) as an asymptotic series in increasing 
powers of the derivatives of X;" : 


1 AG 
X= xe(1- gt). (3.112) 
An p(1 a exp(—Q/T)) Xn 


If the reaction rate is much larger than the inverse cosmological time, that is, 
Ages i oe er) ey / X;,1, then we have X,, © X,,1in agreement with result (3.84). 
Subsequently, after the temperature has dropped significantly, X;! —> 0, but the 
second term on the right hand side in (3.111) approaches a finite limit. Instead 
of vanishing, therefore, the neutron concentration freezes out at some finite value 
X; = X,(t > oo). The freeze-out effectively occurs when the second term on the 
right hand side in (3.112) is of order the first one or, in other words, when the 
* annihila- 





deviation from equilibrium becomes significant. This happens before e 
tion and after the temperature drops below Q ~ 1.29 MeV (as can be checked a 
posteriori). Consequently we can set A,-,) ~ 2An, and neglect exp(—Q/T) in the 
equality —Xn1/Xn) ~ Ans p» Which determines the freeze-out temperature. Sub- 
stituting into this equality expression (3.110) for X77 and expression (3.106) for 
Any and using the temperature-time relation (3.88), the equation for the freeze-out 


temperature reduces to 

(: ET. . 

~*~) (= 40.25) ~0.18K!/. (3.113) 
5) ( Q ) 


In the case of three neutrino species we have « ~ 3.54 and the freeze-out tem- 
perature is T,. ~ 0.84 MeV. The equilibrium neutron concentration at this time is 
Xn\(T,) & 0.18. Of course, this number gives only a rough estimate for the ex- 
pected freeze-out concentration. One should not forget that at T = T,, deviations 
from equilibrium are very significant and, in fact, X,,(T,) exceeds the equilibrium 
concentration by at least a factor of 2. Nevertheless, the above estimate enables 
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us to see how the freeze-out concentration depends on the number of relativistic 
species present at the freeze-out time. Because 7, « «!/® 
components increase 7, and, hence, more neutrons survive. Subsequently, nearly 
all neutrons fuse with protons to form “He and we anticipate, therefore, that addi- 
tional relativistic species increase the primordial helium abundance. For example, 


, additional relativistic 


in the extreme case of a very large number of unknown light particles, the tem- 
perature 7;, would exceed Q and the neutron concentration at freeze-out would be 
almost 50%. This would lead to an unacceptably large abundance of *He. Thus, 
we see that primordial nucleosynthesis can help us to restrict the number of light 
species. 


Problem 3.16 Find the freeze-out temperature using the simple criterion tf ~ 1/A 
and verify that in this approximation one obtains the result quoted in many books 
on cosmology, namely, 7, o« «!/° 
and the above result, 7, « «!/8? 


. What accounts for the difference between this 


Now we turn to a more accurate estimate for the freeze-out concentration. 
Since X;1 > 0 as T > 0, X* is given by the integral term in (3.111) when we 
take the limit t — oo. The main contribution to the integral comes from T > mg. 
Therefore we set Ay,» ~ 2A, , where A, is given in (3.106). Using (3.88) to 
replace the integration variable t by y = T/Q, we obtain 


(oe) 


oie. | exp(—5.42«7"/? (x +. 0.25)°(1 + e7!/*) dx) 
om 2y2(1 + cosh(1/y)) 





dy. (3.114) 
0 


For the case of three neutrino species (k ~ 3.54) one finds X7 ~ 0.158. This result is 
in very good agreement with more elaborate numerical calculations. The presence 
of an additional light neutrino, accompanied by the corresponding antineutrino, 
increases « by amount 2- Ax ~ 0.58, and the freeze-out concentration becomes 
X; ~ 0.163. Thus, two additional fermionic degrees of freedom increase X; by 
about 0.5% and we conclude that 


X* ~ 0.158 + 0.005(N, — 3), (3.115) 


where JN, is the number of light neutrino species. 


Neutron decay Until now we have neglected neutron decay, 
n> pte +b. (3.116) 


This was justified because the lifetime of a free neutron t,, © 886s is large compared 
to the freeze-out time, t, ~ O(1) s. However, after freeze-out the interactions (3.96) 
and the inverse three-body reaction (3.116) become inefficient and neutron decay 
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is the sole remaining cause for a change in the number of neutrons. As a result, the 
neutron concentration decreases for t > ft, as 


X(t) = X> exp(—t/ty). (3.117) 


Note that after freeze-out one can neglect the degeneracy of the leptons, which 
would increase the neutron lifetime, and use t,, as quoted above. We will see that 
nucleosynthesis, in which nearly all free neutrons are captured in the nuclei (where 
they become stable), occurs at t ~ 250 s. This is a rather substantial fraction of 
the neutron lifetime and hence the neutron decay significantly influences the final 
abundances of the light elements. 


3.5.2 “Deuterium bottleneck” 


Complex nuclei are formed as a result of nuclear interactions. Helium-4 could, 
in principle, be built directly in the four-body collision: p + p+n+n—> +He. 
However, the low number densities during the period in question strongly suppress 
these processes. Therefore, the light complex nuclei can be produced only through a 
sequence of two-body reactions. The first step is deuterium(D) production through 
the reaction 


ptineD+y. (3.118) 


There is no problem with this step because for f < 10° s the corresponding reaction 
rate is much larger than the expansion rate. 

Let us calculate the deuterium equilibrium abundance. We define the abundance 
by weight: 


Xp = 2np/ny, 


where ny is the total number of nucleons (baryons) including those in complex 
nuclei. The relation between Xp and the abundances of the free neutrons, X, = 
n,/ny, and protons, Xp = n,/ny, can be found using (3.61) for each component. 
Because the deuterium nucleus with spin zero is metastable, its total statistical 
weight is gp = 3. Taking into account that g, = g, = 2 and the chemical potentials 
satisfy the condition Wp = fp + Mn, we find 





B 
Xp = 5.67 x 107 gio Tuy exp( = ) Dae ee (3.119) 
TMev 


where 


Bp =m, +m, — Mp ~ 2.23 MeV 
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is the binding energy of the deuterium. We have parameterized the baryon-to-photon 
ratio by 
n 
no = 10!° x . (3.120) 
thy 
This parameter is related to Q,, the baryon contribution to the current critical 
density, via 


Qyh?, ~ 6.53 x 10-7 no. (3.121) 


At temperatures of order Bp, the abundance Xp is still extremely small and 
even at T ~ 0.5 MeV, for example, it is only about 2 x 10713. One of the reasons 
for this is the large number of energetic photons with € > Bp, which destroy the 
deuterium. The number of such photons per deuterium nucleus is 


ny(€ > Bp) B3T e~ Bo/T ~ 1910 1 (32) emt 
np nyXp moXp \ T : 





(3.122) 


which becomes less than unity only at 7 < 0.06 MeV. Therefore, we expect that 
deuterium can constitute a significant fraction of baryonic matter only if the tem- 
perature is about 0.06 MeV. In fact, according to (3.119), for 719 ~ O(1) the equi- 
librium deuterium abundance changes abruptly from 10~> to of order unity as the 
temperature drops from 0.09 MeV to 0.06 MeV. 

The rates of reactions converting deuterium into heavier elements are propor- 
tional to the deuterium concentration and these reactions are strongly suppressed 
until Xp has grown to a substantial value. This delays the formation of the other 
light elements, including *He. In fact, because of the large binding energy of *He 
(28.3 MeV), the equilibrium helium abundance would already be of order unity at 
temperature 0.3 MeV. However, this does not happen and the helium abundance 
is still negligible at 7 ~ 0.3 MeV because the rate of the deuterium reactions, 
responsible for maintaining helium in chemical equilibrium with the nucleons, is 
much smaller than the expansion rate at this time. As a result, the heavier elements 
are chemically decoupled and present in completely negligible amounts despite 
their large binding energies. Only protons, neutrons and deuterium are in chemical 
equilibrium with each other. This situation is usually referred to as the “deuterium 
bottleneck.” 


Problem 3.17 Derive the formula for the equilibrium concentration of *He and 
verify that it is of order unity at T ~ 0.3 MeV. 


Let us determine when the deuterium bottleneck opens up. This occurs when the 
main reactions converting deuterium into heavier elements, 


(1)D+D—> *He+n, (2)D+D—>T+p, (3.123) 
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become efficient. Within the relevant temperature interval, 0.06 MeV to 0.09 MeV, 
the experimentally measured rates of these reactions are 


(ov)pp, =(1.3-2.2) x 107" cm? s“}, 


(ov) pp) =(1.2-2) x 10717 em? s7?, (3.124) 


respectively. Due to reactions (3.123), the number of deuterium nuclei in a comoving 
volume containing Np nuclei decreases during a time interval Art by 


ANp = — (OV) pp NpNpAt. (3.125) 


Rewriting this equation in terms of the concentration by weight, Xp = 2Np/Nvy, 
we obtain 


AXp = —5AppXp At, (3.126) 
where 
App =(ov) pp: + (ov) pp2) aw & 1.3 x 10°K(T) Tyevnio SS ((3.127) 


The function K(7) characterizes the temperature dependence of the reaction rate 
and it changes from 1 to 0.6 as the temperature drops from 0.09 MeV to 0.06 MeV. 
A substantial amount of the available deuterium is converted into helium-3 and 
tritium within a cosmological time t only if 


|AXp| ~ (5) AppXpt & Xp. (3.128) 
It follows that the deuterium bottleneck opens up when 


2 1.2 x 10-5 
bi eae s (3.129) 


nna — b $ 
Appt 7101 Mev Cae 





where we have used the time-temperature relation (3.88) with « ~ 1.11. From 
(3.119), one can express the temperature as a function of Xp: 


0.061 


Tuev(Xp) = . 
Mev(Xp) (1+ 2.7 x 10-7 In(Xp/mio)) 





(3.130) 


Substituting this expression into (3.129) and solving the resulting equation for X . " 


by the method of iteration for 10 > 719 > 107!, we find 


xOO ~ 1.5 x 10*y (1 —7 x 10-7 no). (3.131) 
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Problem 3.18 Verify that after electron—positron annihilation, the value of « in 
(3.85) becomes 


k« ~ 1.114 0.15(N, — 3), (3.132) 


where JN, is the number of neutrino species. (Hint Recall that the neutrino and 
radiation temperatures are different after e~ annihilation.) 





After the deuterium concentration reaches X . oe everything proceeds very 
quickly. According to (3.130) the equilibrium concentration Xp increases from 
10~* to 10~? as the temperature drops from 0.08 MeV to 0.07 MeV. As a result, the 
rate of deuterium conversion into heavier elements, proportional to Xp, becomes 
100 times larger than the expansion rate. Such a system is far from equilibrium and 
nucleosynthesis is described by a complicated system of kinetic equations which 
are usually solved numerically. In Figures 3.7 and 3.8 below we present the results 
of highly precise numerical calculations for the time evolution of the light element 
concentrations and for their final abundances, respectively. We will now show how 
these results can be reproduced analytically with good accuracy. The system of ki- 
netic equations will be solved using the quasi-equilibrium approximation. This will 
provide us with a solid physical understanding of primordial nucleosynthesis and 
will reveal the reasons for the dependence of final abundances on the cosmological 
parameters. To simplify our task we consider only the most abundant isotopes up 
to 7Be, among which are “He, D,*He, T, lithium-7 (7Li) and beryllium (7Be) itself. 
Other elements such as °Li, 8B etc. are produced in much smaller amounts and will 
be ignored. 

The most important nuclear reactions are shown schematically in Figure 3.6. 
The reader is encouraged to keep a copy of this figure at hand throughout the 
rest of this section. Every element corresponds to a “reservoir.” The reservoirs are 
connected by “one-way pipes”, one for each nuclear reaction converting an element 
into another. To simplify the diagram, we include only the initial elements involved 
in the reaction; the outcome can easily be inferred from the diagram. The efficiency 
of the pipe is determined by the reaction rate. For example, for the rate of escape 
from the reservoir A due to the reaction AB — CD, we find 


Xq, = —Ap'AapX Xp, (3.133) 
and the rate of increase of the element C is 
Xc = AcAx' Ag AapXaXp. (3.134) 


Here Xq = Aana/ny, etc. are the concentrations by weight of the corresponding 
elements, A are their mass numbers (for example, Ap = 2 and Ay = 3, etc.) and 
dap = (ov) pny. The reaction is efficient only if X,/X, >1t7!. 
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3HeD 3He 4He 





Fig. 3.6. 


The general picture is as follows. Until the temperature drops to 0.08 MeV, the 
p,n and D reservoirs are in equilibrium with each other and decoupled from the 
rest (the deuterium bottleneck). However, as soon as the temperature drops to 0.08 
MeV, the DD pipes become very efficient, rapidly converting the deuterium supply 
from the np reservoir into heavier elements. Finally, nearly all free neutrons have 
been bound in nuclei. Around this time the concentrations of the elements in the 
various “reservoirs” freeze out at their final abundances. Now we consider the 
build-up of each element in detail. 


3.5.3 Helium-4 


Once deuterium reaches the abundance X . ” the bottleneck opens and nucleosyn- 
thesis begins. However, at the beginning, deuterium production in the reaction 
pn— Dy is still greater than its destruction in DD reactions. The ratio of the 
corresponding rates is 





Lex, 10-#\° 
ea 10'( ) (3.135) 
App X45 Xp 


where the experimental value for A), /App is about 10-3 at Tey X 0.07-0.08 and 
we have set X, ~ 0.16, X, ~ 0.84. Because of the very high supply rate, deu- 
terium remains in chemical equilibrium with nucleons until its abundance rises to 
Xp ~ 107. After that, two-body DD reactions become dominant and Xp begins to 
decrease — see Figure 3.7, where the time dependence of abundances for nig ~ 7 
is shown. (Note that the deuterium photodestruction can be ignored now because it 
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Fig. 3.7. 


alone cannot prevent Xp from further growth.) Although the deuterium concentra- 
tion ceases to grow, the concentration of free neutrons strongly decreases because 
they go first to the deuterium reservoir and then, without further delay, proceed 
down the pipes towards the heavier elements. For most of the neutrons, the final 
destination is the *He reservoir. In fact, the binding energy of *He (28.3 MeV) 
is four times larger than the binding energies of the intermediate elements, *He 
(7.72 MeV) and T(6.92 MeV) and, therefore, if “He were in equilibrium with these 
elements, it would dominate at low temperatures. A system always tends to equi- 
librium in the quickest possible way. Therefore, most of the free neutrons will form 
*He to fulfil its largest equilibrium demand. 


Problem 3.19 Verify that at T ~ 0.1 MeV the equilibrium concentrations of D, 
He and T are many orders of magnitude smaller than the He concentration. 


The reactions in which *He is formed proceed as follows. First, deuterium is 
converted into tritium and *He according to (3.123). Next, tritium combines with 
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deuterium to produce *He: 
TD > “Hen. (3.136) 


In this sequence, two of the three neutrons end up in the newly formed “He nucleus 
and one neutron returns to the np reservoir. The *He nucleus can interact either 
with a free neutron and proceed to the T reservoir, 


3Hen > Tp, (3.137) 
or with deuterium and go directly to the *He reservoir, 
>HeD > *Hep. (3.138) 


The ratio of the rates for these reactions is 
A3Hen X3He X n zaf Xn 
A3yep X3HeX D Xp 





(3.139) 


Hence, until the concentration of free neutrons, X,, drops below Xp (which never 
exceeds 10-7), (3.137) is more efficient than (3.138). Therefore, most of the neu- 
trons are fused into *He through the reaction chains np > D —> T -> *He and 
np > D— *He > T > *He. Within a short interval around the time when the deu- 
terium concentration reaches its maximal value Xp ~ 1077, nearly all neutrons, 
except a very small fraction ~10~+, end up in *He nuclei. Therefore, the final *He 
abundance is completely determined by the available free neutrons at this time. 
According to (3.130), Xp is of order 10~? at temperature 


TN. ~ 0.0701 + 0.03 In mio), (3.140) 
or, equivalently, at time 
t\™) ~ 269(1 — 0.07(N, — 3) — 0.06 In n10), (3.141) 


where we have used (3.132) for « in (3.88). Because half of the total weight of *He 
is due to protons, its final abundance by weight is 





1%) 
Xi, = 2Xn(t™) = 2X" exp(- ) (3.142) 


n 


Substituting here X* from (3.115) and t™ from (3.141), one finally obtains 
Xf, © 0.23 + 0.012(N, — 3) +0.005 In mio (3.143) 


This result is in good agreement with the numerical calculations shown in Figure 3.8. 
The *He abundance depends on the number of ultra-relativistic species N, and 
the baryon density characterized by n 9. The presence of an additional massless 
neutrino increases the final abundance by about 1.2%. This increase comes from two 
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sources which give comparable contributions. First, the greater the number of ultra- 
relativistic species, the faster the universe expands for a given temperature. This 
means neutrons freeze out earlier, leading to larger X*. Second, if there are more 
light species, the nucleosynthesis temperature is reached sooner and more neutrons 
avoid decay. Thus, given 719, we can put rather strong bounds on the number of 
unknown light species using the observational data on helium-4 abundance. We will 
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see later that 719 can be determined with high precision from data on deuterium 
abundance and CMB fluctuations. 

It follows from (3.141) that nucleosynthesis begins earlier in a more dense uni- 
verse and hence more neutrons are available. Therefore, the final helium-4 abun- 
dance depends logarithmically on the baryon density and, according to (3.143), 
increases by 1% or so if the baryon density is 10 times larger. 


3.5.4 Deuterium 


To calculate the time evolution and freeze-out abundance of deuterium, we make 
a series of assumptions which drastically simplify our task. The validity of these 
assumptions can be checked a posteriori. 

First, we ignore ’Be and ‘Li because their abundances turn out to be small 
compared to the abundances of *He and T. Second, we assume that the *He and 
T abundances take on their quasi-equilibrium values, that is, they are completely 
determined by the condition that “the total flux coming into each corresponding 
reservoir must be equal to the outgoing flux” (see Figure 3.6). Concretely, in the 
case of He, the amount of *He produced within a given time interval via DD and 
Dp reactions should be equal to the amount of *He destroyed within the same time 
in 7HeD and *Hen reactions. 

Let us describe the primordial nucleosynthesis process once more, but this time 
in greater detail. When the deuterium concentration reaches Xp ~ 10~* the DD 
reactions become efficient and the deuterium produced in the pn reaction is quickly 
converted into He and T. Thus, further deuterium accumulation stops and, in fact, 
its concentration begins to decrease. As a result, neutrons are taken from the np 
reservoir and sent, without delay in the D reservoir, directly to the >He and T 
reservoirs along the DD and Dp pipes. From there they proceed through *HeD and 
TD pipes to their final destination — the *He reservoir. 

Not all the neutrons reach the “He reservoir on their first attempt; some of them 
“leak out” on the way there. Concretely, neutrons are released in the reactions 
DD -> *Hen and TD > *Hen and they return to the np reservoir. From there, they 
again try to reach the *He reservoir. Thus, after the beginning of nucleosynthesis, 
there is a steady flux of neutrons from the np reservoir to the +He reservoir through 
the intermediate D, *He and T reservoirs. The system of pipes is self-regulating and 
maintains the *He and T concentrations in accordance with the demands of quasi- 
equilibrium. To be precise, the rate of destruction of *He and T is proportional to 
their concentrations and if, for example, the abundance of *He becomes larger or 
smaller than the quasi-equilibrium concentration, then the size of the *Hen pipe 
grows or shrinks respectively, and the concentration quickly returns to its quasi- 
equilibrium value. 
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If the universe were not expanding, then nearly all free neutrons would end up 
in *He nuclei and there would be negligible abundances of the other light elements. 
However, in an expanding universe, expansion acts as a “shut-off valve” for the 
pipes. At the moment the expansion rate becomes larger than a particular reaction 
rate, the corresponding pipe closes. When all pipes entering a reservoir have closed, 
the abundance of that light element freezes out. The final abundances of *He and 
T are determined by the freeze-out concentration of deuterium, which we now 
calculate. 

Let us derive the system of kinetic equations for the abundances by weight X,,, 
Xp, Xz and X3ye-. The concentration of free neutrons decreases due to the reactions 
pn —> Dy and*Hen —> Tp but increases in the processes DD —> *Hen and DT > 
4Hen. Therefore, taking into account (3.133) and (3.134), we obtain 


dXn 
dt 


Deuterium is produced only in the reaction pn — Dy and destroyed in the reactions 
DD — *Hen, DD >Tp, Dp —> *Hey, *HeD > *Hep, DT > *Hen. Hence, 





= —ApnXpXn — FAsttenXsHeXn + FAppiXp + gAvtrXpXr. (3.144) 


D5 XX, — Alpe XS — Ane XX p — sap ko Xr ts Xue X 
“ape c* pn4pAn — FADDAD — ADpPADA p — ZA4DTADAT — 3A3HeD43HEAD:> 
(3.145) 
where App = App1 + Appz. The equation for tritium is obtained similarly: 
ap => 7App2 Xp + Hen X3HeXn —- sApTxDXT. (3.146) 


We assume that the tritium concentration satisfies the quasi-equilibrium condition, 
that is, the rate of its overall change is much smaller than the rates of the individual 
reactions on the right hand side of (3.146). Therefore, we set dX7/dt ~ O and 
(3.146) reduces to 


3App2Xp + d3Hen X3He Xn © sAprXpXr. (3.147) 
The quasi-equilibrium condition for helium-3 takes the form 
Appi Xp + sADpXDX p © Aspe Xe XD + Aston X3HeXn- (3.148) 


Using (3.147) and (3.148) to express X3y_. and Xy through the neutron and deu- 
terium concentrations, (3.144) and (3.145) become 
aX, 


via App Xp — ApnX pXn, (3.149) 





dXp 


Fe = Amn X Xn — AppXp — 2AppXvX p- (3.150) 
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It is convenient to rewrite these equations using a temperature variable instead of a 
time variable (see (3.88)). Substituting the explicit value for App from (3.127) then 
gives 








dX, 
= amio(RiXn — Xp), (3.151) 
dTMev 
dX 
> = 4anio(X3, + RoXp — 4RiX,), (3.152) 
dTMev 


where 
a = a(T) = 0.86 x 10°K(T) 


and the coefficient K(T) describes the temperature dependence of (ov) pp . Its value 
changes from 1 to 0.5 when the temperature drops from 0.09 MeV to 0.04 MeV. 
Over the same temperature interval the coefficients R; and R2 are 








o a 
R =4X,7 ~ (3-8) x 1073, Ry = 2X, ~ (2.5-2.3) x 1075, (3.153) 
DD DD 


where the experimental value for the ratio of the corresponding reaction rates has 
been used. The system of equations (3.151) and (3.152) has attractor solutions. 

First we consider the initial stage of nucleosynthesis when Xp « X,,. It turns 
out that in this case the deuterium concentration satisfies the quasi-equilibrium 
condition and we can set dXp/dT ~ 0 in (3.152). Since Rz « Rj}, the term R2Xp 
is small compared to R, X,,, and it follows from (3.152) and (3.151) that 


5 ne as eae 140 Xp (3.154) 
mr Noe | : 


This solution is valid after the deuterium concentration reaches its maximal value 
of order 10? and begins to decrease (Figure 3.7). It fails as soon as X,, drops to 
Xp and, at this time, X, ~ Xp ~ R;. Note that, according to (3.154), the maximal 
concentration of deuterium is equal to Xp ~ 107? for X,, ~ 0.12. This is in agree- 





ment with the naive estimate derived earlier by comparing the rates of pn and DD 
reactions. Substituting (3.154) into (3.151), we obtain 


dX, 
dTMev 


In this regime the neutrons determine their own fate and also dictate the quasi- 





(and SanioR| Xn. (3.155) 


equilibrium concentrations to the other elements, including deuterium. In other 


words, they regulate the shut-off valves between the reservoirs in Figure 3.6. At 


the beginning of nucleosynthesis, at T = Bas most of the neutrons are still free, 
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and hence X, ~ 0.12. Neglecting the temperature dependence of a, we then find 
the approximate solution of (3.155): 


X,(T) ~ 0.12 exp(JamoRi (7 - Ty) (3.156) 


where sae is given in (3.140). It follows that the neutron concentration becomes 
comparable to the deuterium concentration, X, ~ Xp ~ Rj, at the temperature 


Tyey ~ 0.07 + 0.002 In n19 — 0.02K~'njy- (3.157) 


In a universe with very low baryon density, K 19 < 0.3, the abundance of free 
neutrons (neglecting their decay) does not decrease below Xp and freezes out at 
the value 


xfn 0.12 exp(—SamoRi Tun) ~ 0.12 exp(—10K mo). (3.158) 


The remaining free neutrons then decay. This explains why, for instance, the *He 
abundance is less than 1% in a universe with n19 ~ 1077 (Figure 3.8). 


Problem 3.20 At which value does the deuterium concentration freeze out in low 
baryon density universe? How does it depend on 7719? 


In the derivation of the “He abundance presented above, we tacitly assumed that 
the reactions converting the neutrons into +He are very efficient in transferring most 
of the available neutrons into heavier elements. This means that (3.143) is valid 
only for K n19 > 0.3. Observations suggest that 10 > 19 > 1 and therefore we will 
assume below that 719 > 1. 

When the neutron concentration becomes of order the deuterium concentration, 
(3.154) fails and the system quickly reaches another attractor. Afterwards, the 
neutron concentration satisfies the quasi-equilibrium condition, dX,,/dT ~ 0, and 
it follows from (3.151) that 


X,= ee: 1+0 Xn (3.159) 
I ieige Xp) |" 





Equation (3.152) then becomes 


dXp 
dTMev 





= 2anio(Xp +2R2Xp). (3.160) 


Now the deuterium determines its own fate and regulates the quasi-equilibrium 
concentrations of the neutrons and the other light elements. Since R2 changes 
insignificantly within the relevant temperature interval (see (3.153)), it can be taken 
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to be a constant. Equation (3.160) is then readily integrated: 


T* 


(1+ a )=-(+ ate ) 4R i: (T) dT (3.161 
Xo) Xp)? me] a(T) , 161) 


where the temperature is expressed in MeV. As the temperature decreases, the 








deuterium concentration freezes out at xs = Xp(T — 0). Taking into account 
that Xp(T*) ~ R, > Ro, we obtain 


i 2R> 


Ke = (3.162) 
where 
an 
A =4R) / a(T)dT ~ 4Rya(T*) Trey: (3.163) 
T 


The coefficient A depends only weakly on 119; it increases by a factor of 2 as 719 
goes from 1 to 10’. Taking as an estimate A ~ 0.1, we find good agreement with 
the results of the numerical calculations shown in Figure 3.8. 

For 7190 < 1/A ~ 10, (3.162) simplifies to 


7 2R> 
7 Amio 





xe ~4x 10-“*nig. (3.164) 
For this range of 7 9 the deuterium freeze-out abundance decreases in inverse 
proportion to 719. This dependence on 719 can easily be understood. For 719 < 10, 
the freeze-out concentration X is is larger than R ~ 2 x 10~° and, according to 
(3.160), DD reactions dominate in destroying deuterium. The deuterium freeze-out 
is then determined by the condition Xp /Xp~ ne 3 ~ t7!. Since App KX ny 
N10, we find that, to leading order, x o4 ane 
For n19 > 10, (3.162) becomes 


X) ~ 2Ro exp(—Anw). (3.165) 


In this case the deuterium abundance decays exponentially with 719 and decreases 
by five orders of magnitude, from 10~> to 107!°, when 719 changes from 10 to 
100 (Figure 3.8). In a universe with high baryon density, the reaction Dp > *Hey 
dominates in the destruction of deuterium when Xp < Ro X 2 x 107°. Hence, the 
freeze-out concentration is determined by the term linear in Xp in (3.160). 

Thus, deuterium turns out to be an extremely sensitive indicator of the baryon 
density in the universe. The observational data certainly rule out the possibility of 
a flat universe composed only of baryonic matter. 
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3.5.5 The other light elements 


Now we can calculate the final abundances of the other light elements by simply 
using the quasi-equilibrium conditions. 


Helium-3 The expression for the quasi-equilibrium concentration of *He follows 
from (3.148): 


3 ( Avp1 Apdp Mii Xn V 
X3pe © = xX 2 xX 1+2 : 3.166 
crane Ge Pe aay EN ones Me rae 














If the baryon density is not too large, the rate of the dominant reaction in which *He is 
destroyed is larger than the rate of the deuterium destruction. Therefore, the freeze- 
out of *He occurs a little bit later than the freeze-out of deuterium. After deuterium 
freeze-out, a small leakage from the D reservoir to the *He reservoir still maintains 
a stationary flow through the *He reservoir and the quasi-equilibrium condition for 
3He is roughly satisfied at the time of its freeze-out. Substituting X,, ~ X3,/R, into 
(3.166) and the experimental values for the ratios of the corresponding reaction 
rates, taken for definiteness at T ~ 0.06 MeV, we obtain 


xl ~ 02% + 10% 


i op ear acaae (3.167) 
Me 144 x 103xf 


where yes is given in (3.162). This result is in good agreement with the numerical 
calculations shown in Figure 3.8. For example, for 719 = 1, we have xe ~4x 
10~* and poke ~ 3x 10-5, that is, the final >He abundance is 10 times smaller 
than the deuterium abundance. 

The difference between x and cm decreases for larger 719. For n19 ~ 10, the 
freeze-out concentrations of the deuterium and helium-3 are about the same and 
equal to 10~°. In a universe with no > 10, the reaction Dp > *Hey dominates 
in producing *He around the freeze-out time and nearly all deuterium is destroyed 
in favor of *He, which thus becomes more abundant than deuterium. In this case, 
the freeze-out of *He is determined by two competing reactions, Dp — *Hey and 
3HeD -> *Hen, and, irrespective of how large Xp is, they give rise to the final 
3He abundance, X io ~ App/AMBHeD ~ 10->. The weak dependence of X ae on the 
baryon density for 719 > 10 is due to the temperature dependence of the reaction 
rates, which we have ignored. 


Tritium The quasi-equilibrium condition (3.147) gives 


3 App2 Aspen Xn 
xXr= 2 xX Xp. 3.168 
T € i= + Aor XZ 3He | XD ( ) 
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Assuming that tritium freeze-out occurs at about the same time as for deuterium, 
and substituting X, ~ X3/Ri into (3.168), we find 


xin (0.015 +3x 10°x/ ) x (3.169) 


3He 
where the experimental values App2/Apr & 0.01 and Asye,/Apr & 1 have been 


used. For 719 ~ 1, we have xe ~ 107°. Note that, for any 710, the tritium final 
abundance is several times smaller than the deuterium abundance. 


Problem 3.21 When does tritium freeze-out take place? For which 719 can we use 
xe in (3.168) to estimate xz ? Which value of Xp should be used otherwise? 


Problem 3.22 Explain why the *He concentration increases monotonically in time 
(see Figure 3.7) but the tritium concentration first rises to a maximum and then 
decreases until it freezes out. 


Lithium-7 and beryllium-7 The quasi-equilibrium conditions for “Li and ‘Be result 
from the dominant reactions in which ’Li and ’Be are produced and destroyed (see 
Figure 3.6): 


7 

Tp MHeTX tHe XT + A7BenX7BeAn = AnLipXLiX p, (3.170) 
7 
1p MHeHeX He Xe => Ben X7BeXn- (3.171) 


One can check that other reactions, such as 7Li + D > 24 He +n and ’Be + D> 
2*He + p, can be ignored for 719 > 1. It follows from these equations that 


P Kia fh a 
perme reeine c (a LAN) (cree eect coe (3.172) 
12 Xp \ Mitip Met 








The ratio A4yet/A7L ip 1S nearly constant over a broad temperature interval, increasing 
only from 2.2 x 10-7 to 3 x 107? as the temperature drops from 0.09 MeV to 
0.03 MeV, while 


r(T) = MHe3He 
A4HeT 


changes significantly over the same temperature interval, namely, r ~ 5 x 107? 
for T ~ 0.09 MeV andr ~ 6 x 107? for T ~ 0.03 MeV. With these values of the 
reaction rates we obtain 


Xnj ~ 10-*(X7 + r(T) Xsp¢)- (3.173) 


To estimate the freeze-out concentration for 7Li we must know the values of X7, 
r(T) and X3y. at ’Li freeze-out. For 5 > no > 1, freeze-out occurs after the 
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deuterium reaches its final abundance, and we can substitute into (3.173) the values 
of X i and X. u obtained previously. For 719 ~ 1, the first term on the right hand side 
in (3.173) dominates and, using the estimate xin 10>, we obtain Xe ~ 107°. 
For larger 710, the tritium abundance xe is smaller and, consequently, xi de- 
creases aS 719 grows, but only until the second term on the right hand side in 
(3.173) starts to dominate. The minimum final ‘Li abundance, Nes ~ 107!°, is 
reached for 719 between 2 and 3 (see Figure 3.8). Then, further increase in 719 
causes the ’Li abundance to rise. This rise is mostly due to the temperature depen- 
dence of r; for 719 > 3, the freeze-out temperature is determined by the efficiency 
of the ’Ben reaction, which in turn depends on the neutron concentration. In a 
universe with high baryon density, the deuterium and free neutrons burn more effi- 
ciently and disappear earlier (at a higher temperature) than in a universe with low 
baryon density. Therefore, the ’Li concentration freezes out at a higher temperature 
at which r is larger. Note also that, for 719 > 5, the 7Ben reaction becomes ineffi- 
cient before *He reaches its freeze-out concentration, and hence, to estimate X He 
properly we have to substitute in (3.173) the actual value of ee at Li freeze-out, 
which is larger than X i Numerical calculations show that after passing through a 
relatively deep minimum with X a ~ 107!°, the lithium concentration comes back 
to 10~° at no X 10. 

In summary, the trough in the xt — 19 curve is due to the competition of 
two reactions. In a universe with no < 3, most of the ’Li is produced directly in 
the *HeT reaction. For no > 3, the reaction 7Ben is more important and Li is 
produced mainly through the intermediate ‘Be reservoir. 

Beryllium-7 is not so important from the observational point of view, so, simply 
to gain a feeling for its abundance, we estimate itin the range 5 > 719 > 1, in which 
Be freeze-out occurs after that of deuterium. The quasi-equilibrium solution for 
free neutrons is valid at this time and, substituting X, ~ pes /R, into (3.171), we 
find 





f f 
7 Xr xX X 
xf, ee oe 3He*He —He, oy 10-1? _—"He (3.174) 
D D 


where the experimental values for the ratios of the relevant reactions have been used. 
In this case, the product of the corresponding ratios changes by a factor of 5 over 
the relevant temperature interval, so (3.174) is merely an estimate. For ni9 = 1, 
we have x ~4~x 1074, ele ~ 3 x 10~> and, hence, Xe. ~ 2.5 x 10719. 

The observed light element abundances are in very good agreement with theo- 
retical predictions, thus lending strong support to the standard cosmological model. 


Observations suggest that 7 > 719 > 3 at 95% confidence level. 
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3.6 Recombination 


The most important matter ingredients in thermodynamical processes after nucle- 
osynthesis are thermal radiation, electrons, protons p (hydrogen nuclei) and fully 
ionized helium nuclei, He?+. The concentrations of the other light elements are very 
small and we neglect them here. As the temperature decreases, the ionized helium 
and hydrogen nuclei begin to capture the available free electrons and become elec- 
trically neutral. In a short period of time, nearly all free electrons and nuclei have 
combined to form neutral atoms and the universe becomes transparent to radiation. 
Since this process occurs so quickly, we refer to this epoch as the recombination 
moment. 

We must, however, distinguish the helium and hydrogen recombinations, be- 
cause they happen at different times. Helium has significantly larger ionization 
potentials than hydrogen and therefore becomes neutral earlier. However, after he- 
lium recombination, many free electrons remain and the universe is still opaque to 
radiation. Only after hydrogen recombination have most photons decoupled from 
matter; these are the photons that give us a “baby photo” of the universe. As a 
result, hydrogen recombination is a more interesting and dramatic event from an 
observational point of view. 

Helium recombination, nevertheless, has some cosmological relevance. When 
helium becomes neutral it decouples from the plasma thus altering the speed of 
sound in the radiation—baryon fluid. We will see in Chapter 9 that this speed influ- 
ences the CMB temperature fluctuations. 

Recombination is not an equilibrium process. Hence, the formulae derived under 
the assumption of local equilibrium can only be used to estimate when recombina- 
tion occurs. This is sufficient when we consider helium recombination. However, the 
subtleties of hydrogen recombination are very important for the calculation of the 
CMB temperature fluctuations. Therefore, after estimating the hydrogen recombi- 
nation temperature based on the equilibrium equations, we will use kinetic theory 
to reveal the details of nonequilibrium recombination. 


3.6.1 Helium recombination 


The electric charge of the helium nucleus is 2, so it must capture two electrons to 
become neutral. This occurs in two steps. First, the helium captures one electron, 
becoming a singly charged, hydrogen-like ion Het. The binding energy of this ion 
is four times larger than the binding energy for hydrogen: 


By =me+ m24—-—My,= 54.4 eV, (3.175) 


where m2, and m, are the masses of He** and Het respectively. This energy 
corresponds to a temperature of 632 000 K. To estimate the temperature at which 
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most of the helium nuclei are converted into helium ions, we assume that the 
reaction 


He** +e" SHet+y (3.176) 


is efficient in maintaining the chemical equilibrium between He?*+ and He*. Then, 
the chemical potentials satisfy 


Ho4+ + Me = M+, (3.177) 


and considering the ratio (n24n-)/n+, where the number densities are given by 
(3.61), we obtain the Saha formula: 


: Lyin B 
Bates POSES SEE Ve a (3.178) 
N+ 8+ Qn T 


The ratio of the statistical weights here is equal to unity. Even complete recombi- 
nation of helium reduces the number of free electrons by 12% at most. Therefore, 
before hydrogen recombination, the number density of free electrons is 





Ne X (0.75 to 0.88)ny X% 2 x 107 nyoT?. (3.179) 


Substituting this into (3.178), we obtain 


nNo+ 3 By By 
—~ 35.6 1 — —1 ; 3.180 
ae exo( + 5 n( T ) 7 nmo) ( ) 





If the expression in the exponent is positive, the concentration of He* ions is small 
compared to the concentration of completely ionized helium. Using the method of 
iteration, we find that at the temperature 


By 


| aa ae ae 
*~ 42 —Innio 


~ 15.000 x (1 +2.3 x 1077 In mio) K, (3.181) 
the ratio n>, /n4 is of order unity. At this time, the Het ions constitute about 50% of 
all helium and the rest is completely ionized. Very soon after this, nearly all helium 
nuclei capture an electron and are converted into He*. Expanding the expression in 
the exponent in (3.180) about T = 7, we find, to leading order in AT = 7, — T 


< Ty, 

n+ Bs, AT AT 

— ~ exp| ——-—— ] ~ exp|{ —42—— ]. (3.182) 
Ny Ty T, Ty 





When the temperature falls only 20% below T. (going from 15 000 K to 12 000 K), 
the number density of He?+ reduces to n24 ~ 1074n,. We see in (3.181) that the 
temperature 7; varies logarithmically with the baryon density 719; the larger the 
baryon density, the earlier recombination occurs (i.e., at a higher temperature). 
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After most of the helium is converted into He* ions, the singly charged ions 
capture a second electron and become neutral. The second electron also ends up in 
the first orbit. The electron—electron interaction substantially reduces the binding 
energy: it is only 24.62 eV for the second electron. Therefore, the second stage of 
helium recombination occurs at a lower temperature than the first. For example, 
at T ~ 12000 K, the density of neutral helium atoms is still negligible; only after 
the temperature decreases below T ~ 5000 K does helium become neutral and 
decouple from radiation. At this time hydrogen is still fully ionized and the universe 
remains opaque to radiation. 


Problem 3.23 Assuming chemical equilibrium, derive the expression for the ratio 
of the number densities of He* and neutral He. Verify that for 119 ~ 5 this ratio is 
equal to unity at 7 ~ 6800 K and is about 10~* at T ~ 5600 K. 


Problem 3.24 Explain why the recombination temperature is significantly smaller 
than the corresponding ionization potential energies. 


3.6.2 Hydrogen recombination: equilibrium consideration 
The main reaction responsible for maintaining hydrogen and radiation in equilib- 
rium is 
pte —Hrty, (3.183) 


where H is a neutral hydrogen atom. For the ground(1S) state, the binding energy 
of neutral hydrogen, 


By =m, +m, — my = 13.6 eV, (3.184) 


corresponds to a temperature of 158 000 K. In this case, the Saha formula can be 
derived in the same manner as (3.178) and takes the form 


3/2 
pie f- te ae a (3.185) 
ny 20 T 


where ny is the number density of the hydrogen atoms in the ground state and we 
have taken into account that the corresponding ratio of the statistical weights g; 
is equal to unity. At equilibrium, neutral hydrogen atoms are also present in the 
excited states: 25, 2P.... However, at T < 5000 K, their relative concentrations 
are negligible: for example, 


3B 
eS £2? exp( — 7) 210°, (3.186) 
ny &is AE 
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Therefore, for now, we neglect the excited hydrogen atoms and introduce the ion- 
ization fraction: 


Ne 
X,= 





(3.187) 


Ne +ny 


Since n, = ne and 
ne + ny X 0.75 x 107! mony ~ 3.1 x 10-8po(T/Tyo) cm™3, (3.188) 
(3.185) becomes 








Xe 37.7+ 2in( BH) — Ba _, (3.189) 
= ex ’ n — —In : ; 
a lies Tass Or i aa ia 
The expression in the exponent vanishes at 
Ta __ Ba a 3650(1 +2.3 x 10-7 In mio)K. (3.190) 
43.4 — Inno 


At this time, the ionization fraction is X, ~ 0.6 and the temperature T,,. character- 
izes the moment when hydrogen recombination begins. At earlier times X, — 1; 
for example, 1 — X, ~ 10~> at T ~ 5000 K. As soon as the temperature decreases 
below T,,-, recombination proceeds very rapidly. According to (3.189), a 10% fall 
in temperature reduces the ionization fraction by a factor of 10; at T ~ 2500 K, we 
have X, ~ 1074. 

The equilibrium Saha formula tells us that the ionization fraction should con- 
tinue to decrease exponentially as the temperature drops. However, this does not 
occur in an expanding universe; the ionization fraction freezes out instead. More 
importantly, the equilibrium description fails almost immediately after the begin- 
ning of recombination. The main reason for the failure of the Saha formula is the 
large number of energetic photons emitted when the nuclei and electrons combine. 
These nonthermal photons significantly distort the high-energy tail of the thermal 
radiation spectrum exactly at energies crucial to recombination. As a result, it be- 
comes essential to take into account deviation from equilibrium and we must use 
kinetic theory. 


3.6.3 Hydrogen recombination: the kinetic approach 


Direct recombination to the ground state accompanied by the emission of a photon 
does not substantially increase the number of neutral atoms because the emitted 
photon has enough energy to immediately ionize the first neutral hydrogen atom 
it meets. These two competing processes occur at very high rates and result in 
no net change in ny. More efficient is cascading recombination, in which neutral 
hydrogen is first produced in an excited state and then decays to the ground state in 
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a sequence of steps. However, even in cascading recombination, at least one very 
energetic photon is emitted corresponding to the energy difference between the 2 P 
and 1S states of the hydrogen atom. This so-called Lyman-a photon, L., has energy 
3 By/4 = 117000 K and a rather large resonance absorption cross-section, 0, 
10—!7-10-'° cm’, at the recombination temperature. Therefore, the Ly photons are 
reabsorbed in a short time, Ty ~(dyn) | ~ 10°-104 s, after emission. We have to 
compare this time to the cosmological time at recombination. During the matter- 
dominated epoch, the cosmological time can easily be expressed in terms of the 
radiation temperature by equating the energy density of cold particles (see (3.1)) 
with the critical density e” = 1/(6zt?) and noting that T = T,o(1 + z). We obtain 


-12f Tyo \>? 
tee & 2.75 x 10!7(Qmh35) (=) ; (3.191) 


At the moment of recombination, t, < t ~ 101° s and, consequently, the L,, pho- 
tons are not significantly redshifted before reabsorption. To simplify our consider- 
ations we will neglect the redshift effect. 

The presence of a large number of L, photons and other energetic photons 
results in a greater abundance of electrons, protons and hydrogen atoms in excited 
2S and 2P states than expected according to the equilibrium Saha formula. This 
delays recombination so that, for a given temperature, the actual ionization fraction 
exceeds its equilibrium value. The full system of kinetic equations describing non- 
equilibrium recombination is rather complicated and is usually solved numerically. 
To solve them analytically, we use the method of quasi-equilibrium concentrations, 
as was applied to the problem of nucleosynthesis. The results obtained by this 
method are in good agreement with the numerical calculations. 

We will make a series of simplifying assumptions whose validity can be checked 
a posteriori. First of all, we neglect all highly excited hydrogen states and retain 
only the 1S, 2S and 2P states of neutral hydrogen. The remaining ingredients are 
electrons, protons and thermal photons, as well as L, and other nonthermal pho- 
tons emitted during recombination. The main reactions in which these components 
participate are symbolically represented in Figure 3.9. The direct recombination to 
the ground state can be ignored because it results in no net change of neutral hydro- 
gen, as explained above. Thermal radiation dominates in ionizing the 2S and 2P 
states. In fact, to ionize an excited atom, the energy of a photon must only be larger 
than By/4. The number of such thermal photons is much greater than the number 
of energetic nonthermal photons, and hence, when considering the ionization of 
excited atoms, we can ignore the distortion of the thermal radiation spectrum. 

In contrast, thermal photons play no significant role in transitions between 15 
and 2 P states after the beginning of recombination. These transitions are mostly due 
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to the nonthermal L, photons. When the deviation from equilibrium becomes large 
and the 2S level is overpopulated, we can ignore the transition 1S +y+y— 
2S compared to the two-photon decay 2S > 1S+ y+ y. (A transition with a 
single photon is forbidden by angular momentum conservation.) The rate for the 
two-photon decay, W2s5-.15 ~ 8.23 s—!, is very small (compare, for example, with 
Wop-s1s ~ 4 x 108 s~!); nevertheless, this decay plays the dominant role in non- 
equilibrium recombination. In terms of the pipe-and-reservoir picture, the two- 
photon transition is the main source of irreversible leakage from the e, p reservoir 
to the 1S reservoir. Because all other processes result in high-energy photons which 
reionize neutral hydrogen and return electrons to the e, p reservoir, the rate of net 
change in the ionization fraction is 


dX, a dX 15 
dt — dit 
where X, =n,./nr, X25 = N25/Nr, and nr is the total number density of neutral 
atoms plus electrons, given in (3.188). Once a substantial fraction (~50%) of neutral 
hydrogen has formed, (3.192) is a good approximation to use until nearly the end 
of recombination. 








ss Woe Xd, (3.192) 
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To express X25 in terms of X,., we use the quasi-equilibrium condition for the 
intermediate 2S reservoir; this is justified by the high rate of the reactions, shown 
in Figure 3.9. For the 2S reservoir, this condition takes the following form: 


(OD) op 98 NeNp 10 ) Soy ny'nos —_ Wos-+1sN25 = 0, (3.193) 


where ny is the number density of thermal photons. The relation between the cross- 
sections for the direct and inverse reactions, ep < y25S, can be found if one notes 
that in a state of equilibrium these reactions compensate each other. Then, we have 


(2) ,25-epMy _ nenp' (Time ae Bu 3.194 
=—q- = exp{| —-— }, (3.194) 
(OU psa bs Nog 20 4T 





where the Saha formula has been used to obtain the latter equality (recall that the 
binding energy of 2S state is By/4). With the help of this relation, we can express 
X25, from (3.193), as 


-1 
Ws Tm, \7? Ba , 

Nock ee xe, 3.195 

= ats +( 20 ) on 4T ae \ 


and (3.192) becomes 


—l 
dX, Wos Tm. \*? By - 
iy a x2, (3.196 
di »| atic (E) AE ag) Pare, 


When the first term inside the square brackets is small compared to the second, 
the electrons and excited hydrogen atoms are in equilibrium with each other and 
with the thermal radiation. Therefore, the ratio of the e, p and 2S number densities 
satisfies a Saha-type relation (see the second equality in (3.194)). The ionization 
fraction, however, does not obey (3.189) because, as mentioned above, the ground 
state is not in equilibrium with the other levels after the beginning of recombination. 
The excited states are more abundant than one expects in full equilibrium and the 
ionization fraction significantly exceeds that given in (3.189). 











Problem 3.25 The cross-section for recombination to the 2S level is well approx- 
imated by the formula 


By\'? 

(OV) ep-sy25 X 6.3 x 1074 { — cm? s~!. (3.197) 
4T 

Using this expression, verify that the two terms inside the square brackets in (3.196) 

become comparable at the temperature T ~ 2450 K. 


Hence, only at T > 2450 K is the reaction y2S — ep efficient in maintaining 
chemical equilibrium between the electrons, protons and hydrogen 25S states. After 
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the temperature drops below T ~ 2450 K, thermal radiation no longer plays an 
essential role, and the quasi-equilibrium concentration of the 2 states is determined 
by equating the rates for recombination to the 2S level and two-photon decay (see 
(3.193), where the second term can be neglected). At T < 2450 K, the second term 
in the square brackets in (3.196) can be neglected and (3.196) simplifies to 


dX. 
dt 


Thus, in this regime, the rate of recombination is entirely determined by the rate 
of recombination to the 2S level and does not depend on Ws. At this time, the 
number density of L, photons is almost completely depleted due to two-photon 
decays and the 2P states also drop out of equilibrium with electrons, protons and 
thermal radiation. Consequently, nearly every recombination event to the 2P state 
or any other excited state succeeds in producing a neutral hydrogen atom. This 
effect becomes relevant only at late stages and can be incorporated in (3.198) and 
(3.196) by replacing (ov),,_,,,25 with the cross-section for recombination to all 
excited states. The latter is well approximated by the fitting formula 





~ = (0V) ey 5257 X2- (3.198) 


B 0.8 
(FV) oe X 8.7 X 10-4( 24) cm? s7!. (3.199) 


It is convenient to rewrite the corrected (3.196) using the redshift parameter z = 
T/T,o — 1, instead of cosmological time (see (3.191)). After some elementary 
algebra, we obtain 


=| 
dX, os 14400 
~ 0.12 }0.72(—~_)  +104zexp( -——)] x2. 3.200) 
dz en 14400 z 
m"75 


This equation is readily integrated: 


V Dhias dy 
X(z) ~ 6.9 x 10-4 j 
710 0.72y93 + 1.44 x 108y exp(—1/y) 


a ae 
14400 





—1 








(3.201) 


The solution X,(z) is not very sensitive to the initial conditions when X,(zi;) > 
X,(z) because the main contribution to the integral comes from z < zj,. For 
z > 900, or equivalently, at T > 2450 K, the first term in the denominator of 
the integrand can be neglected and expression (3.201) is well approximated by 


V Qmh5 14400 
X(z) 1.4.x 1094" @ 1 exp(-*) (3.202) 
v4 
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In this regime the rate of recombination is completely determined by the rate of 
two-photon decay. Obviously, (3.201) and (3.202) are valid only after the ion- 
ization fraction decreases significantly below unity and the deviation from the 
equilibrium becomes significant. Compared with numerical results, they become 
efficiently accurate after the concentration of neutral hydrogen has reached about 
50% (Figure 3.10). According to (3.202), for realistic values of the cosmological 
parameters (Qnhss ~ 0.3 and no X 5) , this occurs at z ~ 1220 or, equivalently, 
at T ~ 3400 K. Hence, the range of applicability of (3.202) is not very wide, 
namely, 1200 > z > 900. During this time, however, the temperature drops only 
from 3400 K to 2450 K but the ionization fraction decreases very substantially, 
to X, (900) ~ 2 x 1077. It is interesting to compare this result with the prediction 
of the equilibrium Saha formula (3.189), according to which X,(2450 K) ~ 107°. 
Thus, at z ~ 900, the actual ionization fraction is a thousand times larger than 
the equilibrium one. It is also noteworthy that the equilibrium ionization frac- 
tion is completely determined by the baryon density and the temperature but the 
nonequilibrium X,(z) , given in (3.201), also depends on the total density of nonrel- 
ativistic matter. This is not surprising because nonrelativistic matter determines the 
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cosmological expansion rate, which is an important factor in the kinetic description 
of nonequilibrium recombination. 


Problem 3.26 Compare nonequilibrium recombination with the predictions of the 
Saha formula for various values of the cosmological parameters Qahee and 10. 
In which cases is the deviation from the Saha result large immediately after the 
beginning of recombination? 


At z < 900, when the temperature drops below 2450 K, the approximate for- 
mula (3.202) is no longer valid and we should use (3.201). The ionization frac- 
tion continues to drop at first and then freezes out. For example, (3.201) pre- 
dicts X.(z = 800) ~ 5 x 1077, X,(400) ~ 7 x 10-4 and X,(100) ~ 4 x 1074, for 
Ohihag ~ 0.3 and 719 ~ 5. To calculate the freeze-out concentration, we note that 
the integral in (3.201) converges to 0.27 as z goes to zero; hence, 


4/ Qmh /Q 
eee eT GSE 0 


Ki 2S 40° 
710 Qyh75 





(3.203) 


After the ionization fraction drops below unity, the approximate results given in 
(3.201) and (3.202) are in excellent agreement with the numerical solutions of the 
kinetic equations, while the Saha approximation fails completely (see Figure 3.10). 


Problem 3.27 Freeze-out of the electron concentration occurs roughly when the 
rate of the reaction ep — Hy becomes comparable to the cosmological expansion 
rate. Using this simple criterion, estimate the freeze-out concentration. 


At the beginning of recombination, most of the neutral hydrogen atoms are 
formed as a result of cascading transitions, and the number of L, photons is about 
the same as the number of hydrogen atoms. What happens to all these L, photons 
afterwards? Do they survive and, if so, can we observe them today as a (redshifted) 
narrow line in the spectrum of the CMB? During recombination, the number density 
of Ly photons, nq, satisfies the quasi-equilibrium condition for the Ly reservoir: 


Wopisnop = (oq) NaN. (3.204) 


Since nis > nr and nop « x, the number of L, photons decreases with the 
ionization fraction and nearly all of them disappear by the end of recombination. 
Their number density is depleted due to two-photon decays of the 2S states. Hence, 
there will be no sharp line in the primordial radiation spectrum. Nevertheless, as a 
result of recombination, the CMB is warped in this part of the Wien region. This 
region is significantly obscured, however, by radiation from other astrophysical 
sources. 
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Finally, let us find out when exactly the universe becomes transparent to radi- 
ation. This occurs when the typical time for photon scattering begins to exceed 
the cosmological time. The Rayleigh cross-section for scattering on neutral hydro- 
gen is negligibly small and, therefore, despite of the low concentration of elec- 
trons, the opacity is due to Thomson scattering from free electrons. Substituting 
or ~ 6.65 x 10-* cm?, cosmological time ¢ and total number density n, from 
(3.191) and (3.188) respectively, into 


1 
iC~ —— (3.205) 


9 
orn;Xe 


we find that photon decoupling occurs when 


/ 2 3/2 
xt ~ 6 x 109s (72) - (3.206) 
710 Taec 

It follows that Taee ~ 2500 K, or equivalently, zae- ~ 900, independent of the 
cosmological parameters. For Qin ~ 0.3 and 19 ~ 5, the ionization fraction at 
this time is about 2 x 1077. It is interesting to note that this time coincides with 
the moment when e, p and 25S levels fall out of equilibrium and the approximate 
(3.202) becomes inapplicable. 

Radiation decoupling does not mean that matter and radiation lose all thermal 
contact. In fact, the interaction of a small number of photons with matter keeps the 
temperatures of matter and radiation equal down to redshifts z ~ 100. Only after 
that does the temperature of baryonic matter begin to decrease faster than that of 
radiation. There is no trace of this temperature in baryons seen today because most 
of them are bound to galaxies where they are heated during gravitational collapse. 


4 


The very early universe 


The laws of particle interactions are well established only below the energy cur- 
rently reached by accelerators, which is about a few hundred GeV. The next gen- 
eration of accelerators will allow us to go a couple of orders of magnitude fur- 
ther, but even in the remote future it will be impossible to overcome the existing 
gap of about seventeen orders of magnitude to reach the Planckian scale. There- 
fore, the only “laboratories” for testing particle theories at very high energies are 
the very early universe and astrophysical sources of highly energetic particles. 
The quality of cosmological information is much worse than that gained from 
accelerators. However, given the lack of choice, we can still hope to learn es- 
sential features of high-energy physics based on cosmological and astrophysical 
observations. 

The particle theory describing interactions below the TeV scale is called the 
Standard Model and it comprises the unified electroweak theory and quantum 
chromodynamics, both based on the idea of local gauge symmetry. Attempts to 
incorporate the electroweak and strong interactions in some larger symmetry group 
and thus unify them have not yet met with success. Unfortunately, there are too 
many ways to extend the theory beyond the Standard Model while remaining in 
agreement with available experimental data. Only further experiments can help us 
in selecting the “correct theory of nature.” 

This situation determines our selection of topics for this chapter. First, we con- 
sider the Standard Model, and explore the most interesting consequences of this 
theory for cosmology. In particular, the quark—gluon transition, restoration of elec- 
troweak symmetry and nonconservation of the fermion number will be discussed 
in great detail. 

Two important cosmological issues beyond the Standard Model are the gener- 
ation of baryon asymmetry in the universe and the nature of weakly interacting 
massive particles, a possible component of cold dark matter. In the following chap- 
ter we will see that any initial baryon asymmetry is washed out during inflation and 
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its generation is a crucial element of inflationary cosmology. The general conditions 
under which this asymmetry occurs are rather simple and model-independent. How- 
ever, the particular realization of these conditions depends on the particle theory 
involved. At present, there exists no preferable scenario for the origin of baryon 
asymmetry. There are many possibilities and the problem is, as always, to select 
the correct one. For these reasons, we will only demonstrate that the important sin- 
gle number, characterizing baryon excess, can be easily “explained.” The situation 
for the origin of cold dark matter is very similar, and we likewise concentrate on 
general ideas here. 

Almost all plausible extensions of the Standard Model have a number of fea- 
tures in common, which are rather insensitive to the details of any particular 
theory. Among these features is a nontrivial vacuum structure, potentially re- 
sponsible for phase transitions in the very early universe. As a result, topologi- 
cal defects, such as domain walls, strings, or monopoles, could also have been 
formed. There is no doubt that such good physics belongs to a primary course on 
cosmology. 

We begin with a brief overview of the elements of the Standard Model, which 
should by no means be considered a substitute for standard textbooks in particle 
physics. It serves as a reminder of the basic ideas we need in cosmological appli- 
cations. To shorten the presentation, we follow an “antihistorical” approach: the 
theory is formulated in its “final” form, and then its consequences for cosmology 
are explored. However, the reader should not forget that the numerous building 
blocks of the Standard Model were discovered as a result of concerted — and 
rarely straightforward — efforts to understand and interpret an enormous amount 
of experimental data. 


4.1 Basics 


Elementary particles are the fundamental indivisible components of matter. They are 
completely characterized by their masses, spins and charges. Different charges are 
responsible for different interactions and the interaction strength is proportional to 
the corresponding charge. There are four known forces: gravitational, electromag- 
netic, weak, and strong. The first two are long-range forces whose strength decays 
following an inverse square law. The weak and strong interactions are short-range 
forces. They are effective only over short distances and then decay exponentially 
quickly outside this range. Gravity is described by Einstein’s theory of General 
Relativity, and the other three interactions by the Standard Model, based on the 
idea of local gauge invariance. 
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4.1.1 Local gauge invariance 


Particles are interpreted as elementary excitations of fields. The field describing 
free fermions of spin one half (for instance, electrons) obeys the Dirac equation 


iy“ dw —myp =0, (4.1) 


where w is the four-component Dirac spinor and y“ are the 4 x 4 Dirac matrices. 
This equation can be derived from the Lorentz-invariant Lagrangian density 


L=ipy"dny —mypy, (4.2) 


where ~ = w'y?. This Lagrangian is also invariant under global gauge transfor- 
mations: that is, it does not change when we multiply y by an arbitrary complex 
number with unit norm, for example exp (—i@), where @ is constant in space and 
time. What happens, however, if we allow @ to vary from point to point, taking 
0 = eX(x*) to be an arbitrary function of space and time? Will the Lagrangian still 
remain invariant under such Jocal gauge transformation? Obviously not. Acting on 
A(x), the derivative 0,, generates an extra term, 


On > Ole Ww) = e 14, — ie(O AW, (4.3) 


and the invariance of the original Lagrangian (4.2) can only be preserved if we 
modify it by introducing an extra field. Under gauge transformations this field 
should change in such a way as to cancel the extra term in (4.3). Let us consider 
a vector gauge field A,, and replace the derivative 0, in (4.3) with the “covariant 
derivative” 


Dy = Oy +ieAn. (4.4) 
If we assume that under gauge transformations A,, > A uw» then 
Div > Dew) = ed, + ieA,, — ie(d,A)W. 
Therefore we postulate the transformation law 
Au > Ay = Ap + OA, (4.5) 
and find 
Bi, (ep) = el8D wp. 
Hence 


vy" Duy > (be) y"*D, (eb) = Wy" Du. 
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and Lagrangian (4.2), when we substitute D,, for 0,,, is invariant under local gauge 
transformations. 

The gauge field A,, can be a dynamical field. To find the Lagrangian describing 
its dynamics, we have to build a gauge-invariant Lorentz scalar out of the field 
strength A,, and its derivatives. As follows from (4.5), 


Fuy = DyAy — DyAy = Op Av — Ap (4.6) 


does not change under gauge transformations and therefore the Lorentz scalar 
F,,F"” is the simplest Lagrangian we can construct. The scalar A,A", which 
would give mass to the field, is not allowed because it would spoil gauge invariance. 
In the resulting full Lagrangian, 


L=ivy"duw —mby — fF PF” — eby"W)Ag, (4.7) 


in which the reader will immediately recognize electrodynamics with the coupling 
constant proportional to the electric charge e. Because the fine structure constant 
a = e?/4m ~ 1/137 is small, one can consider the interaction term as a small 
correction and hence develop perturbation theory. 

It is convenient to represent this perturbation theory by Feynman diagrams, 
where the interaction term e(Wy"“W)A, corresponds to a vertex where electron 
lines w, y meet photon line A. The incoming solid line corresponds to w and the 
outgoing to y. Assuming that time runs “horizontally to the right,” Figure 4.1(a) 
is read as follows: the electron enters the vertex, emits (or absorbs) the photon, 
and goes on. A rule, which is justified in quantum field theory, is the following: 
an electron “running backward in time” on the same diagram, but reoriented as in 
Figure 4.1(b), is interpreted as its antiparticle, a positron, running forward in time. 





(a) (b) (c) 





time 


Fig. 4.1. 
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Therefore, this diagram describes electron—positron annihilation with the emission 
of a photon. Because the photon is its own antiparticle, we do not need an arrow on its 
line. More complicated processes can be described by simply combining primitive 
vertices. For instance, Figure 4.1(c) is responsible for the Coulomb repulsion of 
two electrons. 

The replacement of all particles by antiparticles (charge conjugation C) corre- 
sponds to the reversal of all arrows on the diagrams. Lagrangian (4.7) is invariant 
with respect to charge conjugation. 


Problem 4.1 Consider a complex scalar field g with Lagrangian 
L= 5(O"p* dup — m’o*9). (4.8) 


How should this Lagrangian be generalized to become locally gauge-invariant? 
Write down the interaction terms and draw the corresponding vertices. 


4.1.2 Non-Abelian gauge theories 


The gauge transformations we have considered so far can be thought of as a mul- 
tiplication of yw by 1 x 1 unitary matrices U = exp (—i@), satisfying U'U = 1. 
The group of all such matrices is called U(1). The local U(1) gauge invariance of 
electrodynamics was realized a long time ago. However, the importance of such 
symmetry was not fully appreciated until 1954, when Yang and Mills extended it 
to SU(2) local gauge transformations. This symmetry was later used to construct 
the electroweak theory. 

The transformations generated by N x N unitary matrices U are called UN) 
gauge transformations. Generalizing from U(1) gauge transformations is very 
straightforward. Let us consider N free Dirac fields with equal masses. Then the 
Lagrangian is 


N 
L= Pi (itay "dup" — myat’) = iby"d.p— mp, (4.9) 
a=1 
where a = 1,..., N and in the second equality we have introduced the matrix 
notation 
y! 
a= --- |, pb =(n, oe bet Wn ) 
WN 


One should not forget that every element of these matrices is in its turn a four- 
component Dirac spinor. The fields yw“ have the same spins and masses, and 
therefore differ only by charges (for instance, in quantum chromodynamics these 
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charges are called “colors”). Lagrangian (4.9) is obviously invariant with respect to 
the global gauge transformation generated by the unitary, spacetime-independent 
matrix U: 


p> UY, 


because y — aU! and U'U = 1. This is no longer true if we assume that matrix 
U is a function of x. As in (4.3), the derivative 0, induces an extra term: 


duh > d,(Up) = U(a, + U'(a,U)) w, 


which needs to be compensated for if we want to preserve gauge invariance. With 
this purpose, let us introduce the gauge fields A,,, which are Hermitian N x N 
matrixes, and replace 0, by the “covariant derivative” 


D, = 9, +igAy, (4.10) 


where g is the gauge coupling constant. If we assume that under gauge transforma- 
tions A,, > A,,, we obtain 


Dw > D,(Uyp) = U(d, +igU-'A,U + U'(a,U)) w. 


(Note that one must be careful with the order of multiplication because the matrices 
do not generally commute.) Therefore, we postulate the transformation law 


A, > A, =UA,U7! + ~(@,0)U"* (4.11) 
Then 
Dy > BD, (Up) = UD)», 
and the Lagrangian 
L=idy"D,p—mpp (4.12) 


is invariant under U(N) local gauge transformations. To derive the Lagrangian for 
the gauge fields, we note that 


F,,, =D, A, —D,A,, = 9,Ay — A, + ig(A,Ay — AyA,) (4.13) 


transforms as F,,, > Fy = UF,,,U~', and hence the simplest gauge-invariant 
Lorentz scalar is tr (FuvF“”). The full Lagrangian is then 


_ 7 2 i 
L=ipy"d,p—mpp — spy" Ayp—str (F.vF“’), (4.14) 


where we have used the standard normalization for the last term in cases where 
N > 2. 
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Problem 4.2 Verify the transformation law for F,,,. (Hint To simplify the calcu- 
lation, justify and use the following commutation rule: D,U = UD,.) 


Thus, starting from a simple idea, we have achieved a significant result. Namely, 
the interactions between fermions and gauge fields, as well as the simplest possible 
Lagrangian for the gauge fields, were completely determined by the requirement 
of gauge invariance. We would like to stress once more that the gauge fields are 
massless because a mass term would spoil the gauge invariance. 

There is an important difference between U(1) and U(N) groups. All elements 
of the U(1) group (complex numbers) commute with each other (Abelian group), 
while in the case of the U(N) group the elements do not generally commute (non- 
Abelian group). This has an important consequence. The U(1) gauge field has no 
self-coupling and interacts only with fermions (the last term in (4.13) vanishes when 
N = 1), or, in other words, this field does not carry the group charge (photons are 
electrically neutral). The non-Abelian U(N) fields do carry group charges and the 
last term in (4.13) induces their self-interaction. 


Problem 4.3 Consider N complex scalar fields instead of fermions and find the 
interaction terms in this case. Draw corresponding diagrams including those de- 
scribing the self-interaction of the gauge fields. 


To find the minimum number of compensating fields needed to ensure gauge 
invariance, we have to count the number of generators of the U(N) group or, in 
other words, the number of independent elements of an N x N unitary matrix. Any 
unitary matrix can be written as 


U = exp(iH), (4.15) 
where H is an Hermitian matrix (H = H'). 


Problem 4.4 Verify that the number of independent rea/ numbers characterizing 
N x N Hermitian matrix is equal to N?. 


In turn, an Hermitian H can always be decomposed into a linear superposition 
of N* independent basis matrices, one of which is the unit matrix 


N?2-1 
H=61+ > 6°T. = 014 6°T¢, (4.16) 
Cc=1 


where T¢ are traceless matrices and 6© are real numbers; hence 
U =e exp (i0°Tc). (4.17) 


The first multiplier corresponds to the U(1) Abelian subgroup of U(NV) and the 
second term belongs to the SU(N) subgroup consisting of all unitary matrixes 
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with det U = 1. Therefore we can write U(N) = U(1) x SU(N), and consider the 
local SU(N) gauge groups separately. The SU(N) group has N? — 1 independent 
generators and hence we need at least that number of independent compensating 
fields a The Hermitian matrix A,, can then be written as 


A, = ACTc. (4.18) 


For SU(2) and SU(3) groups it is convenient to use as the basis matrices oc /2 
and Ac /2 respectively, where oc are three familiar Pauli matrices and Ac are eight 
Gell-Mann matrices, the explicit form of which will not be needed here. 


4.2 Quantum chromodynamics and quark-gluon plasma 


The strong force is responsible for binding neutrons and protons within nuclei. 
The particles participating in strong interactions are called hadrons. They can be 
either fermions or bosons. The fermions have half-integer spin and they are called 
baryons, while the bosons have integer spin and are called mesons. The hadron 
family is extremely large. To date, several hundred hadrons have been discovered. 
It would be a nightmare if all these particles were elementary. Fortunately, they 
are composite and built out of fermions of spin 1/2 called quarks. This is similar 
to the way all chemical elements are made of protons and neutrons. In contrast to 
the chemical elements, each of which has its own name, only the lightest and most 
important hadrons have names reflecting their “individuality.” To classify hadrons 
(or, in other words, put them in their own “Periodic Table’) we need five different 
kinds (flavors) of quarks, which are accompanied by appropriate antiquarks. The 
sixth quark, needed for cancellation of anomalies in the Standard Model, was also 
discovered experimentally. The quarks have different masses and electric charges. 
Three of them, namely u (up), c (charm), and t (top) quarks, have a positive electric 
charge, which is +2/3 of the elementary charge. The other three quarks, d (down), 
s (strange), and b (bottom), have a negative electric charge equal to —1/3. 

Strong interactions of quarks are described by an SU(3) gauge theory called 
quantum chromodynamics. According to this theory, every quark of a given flavor 
comes in three different colors: “red” (r), “blue” (b), and “green” (g). The colors are 
simply names for the charges of the SU(3) gauge group, which acts on triplets of 
spinor fields of the same flavor but different colors. The gauge-invariant quantum 
chromodynamics Lagrangian is 


= = 1 - 
L= Di yr"ayr, aa mY pW fp = 58(P rv" rcv) Ag) 
fc 


1 
—5tt(Fuk"’) ; (4.19) 
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where g;, is the strong coupling constant, Ac(where C = 1, ..., 8) are the eight Gell- 
Mann 3 x 3 matrixes and f runs over the quark flavors u,d,s,c,t, b. There are 
eight gauge fields A , called gluons, which are responsible for strong interactions. 
The symbol 7 denotes the column of three quark spinor fields: 


rf 
wr =| ox |, (4.20) 
&f 
where r is the Dirac spinor describing the red quark with flavor f, etc. The bare 
quark masses m, are determined from experiments. They are very different and 
not so well known. The lightest is the « quark with mass of 1.5-4.5 MeV. The 
d quark is a bit heavier: mg = 5-8.5 MeV. The strange quark has a mass of 80- 
155 MeV, and the remaining three quarks are much heavier: m, = 1.3 + 0.3 GeV, 
mp = 4.3 40.2 GeV, and m, ~ 170 GeV. 

The antiparticles of the quarks are called antiquarks and they can have “antired” 
(7), “antiblue” (b) , and “antigreen” (g) colors. As distinct from photons, gluons 
are also charged. They carry one unit of color and one of anticolor. For instance, 
using the explicit form of Gell-Mann matrix A;, we find that the first interaction 
term in the Lagrangian (4.19) is 


gs(br + Fb)A', 


where we have omitted flavor and spacetime indices together with the Dirac matri- 
ces. The appropriate quark—gluon vertices describing this interaction are shown in 
Figure 4.2. When a quark changes its color, the color difference is carried off by a 
gluon, which in this case is either rb or b7 colored. The state (rb + br) is the first 
state of the “color octet” of gluons. Using the explicit form of the Gell-Mann matri- 
ces, the reader can easily find the remaining seven states of the octet. In principle, 
however, from three colors and three anticolors, we can compose nine independent 
color—anticolor combinations: r7, rb, rz, br, bb, b8, g7, gb, gz. Therefore, one 
is led to ask which particular combination of colors does not occur in Lagrangian 
(4.19) and hence does not participate in strong interactions. The answer is the “color 
singlet” (r7 +bb+ 22) , which is invariant under SU(3) gauge transformations. 
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This color combination would only occur if the unit matrix were among the » 
matrices. But the unit matrix was excluded when we decided to restrict ourselves 
to the SU (3) group instead of the U(3) group. The U(3) group would have an extra 
U(1) gauge boson decoupled from the other gluons. This boson would induce long 
range interactions between all hadrons regardless of electrical charge, in obvious 
contradiction with experiments. 

In contrast to photons, gluons interact with each other. The Lagrangian for non- 
Abelian gauge fields contains third and fourth powers in the field strength A and 
the corresponding interaction vertices have three and four legs respectively. 

Conservation laws are easily determined from the elementary vertices. First of 
all, we see that quark flavor does not change in strong interactions and this leads to 
numerous flavor conservation laws. The total number of quarks minus the number of 
antiquarks also remains unchanged, and hence the total baryon number is conserved 
(by convention a quark has baryon number 1 /3 and an antiquark — 1/3). In addition, 
there is a color conservation law which is analogous to electric charge conservation 
in electrodynamics. 

At first glance, the number of quarks (6 flavors x 3 colors = 18 quarks) seems 
too large to give an elegant explanation of the “Periodic Table of the hadrons”: 
the Periodic Table of chemical elements is built out of only two elementary con- 
stituents — protons and neutrons. However, one should not forget that unlike the 
chemical elements, which can be composed from an arbitrary number of protons 
and neutrons, the few hundred hadrons consist only of quark—antiquark pairs or of 
three quarks. To be precise, all mesons are composed of quark—antiquark pairs and 
all baryons consist of three quarks. For instance, the lightest baryons, the proton 
and the neutron, are composed of uud and udd quarks respectively. The lightest 
meson, 2+, is made of au quark and a d antiquark. 

There is a deep reason why two or four quark bound systems do not exist as 
free “particles.” Every naturally occurring particle should be a color singlet. This 
statement is known as the confinement hypothesis, according to which colored par- 
ticles, irrespective of whether they are elementary or composite, cannot be observed 
below the confinement scale. In particular, quarks are always bound within mesons 
and baryons. 

As we have seen, the colorless gluon state (r7 + bb + g@) does not enter the 
fundamental Lagrangian (4.19). Therefore, it is natural to assume that the appro- 
priate colorless composite particles are “neutral” with respect to strong interaction 
and can exist at any energy scale. The above color singlet can be built only as a 
quark—antiquark pair and corresponds to mesons. Another possible color singlet is 
a three-quark combination: (rbg — rgb + grb — gbr + ber — brg), and it corre- 
sponds to baryons. All other colorless states can be interpreted as describing few 
mesons or baryons. 
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Problem 4.5 Verify that color—anticolor and three-quark combinations are really 
colorless, that is, they do not change under gauge transformation. (Hint The differ- 
ent anticolors can be thought of as three element rows, 7 =(1, 0,0), b =(0, 1,0), 
& =(0, 0, 1) and different colors as corresponding columns.) 


Confinement should, in principle, be derivable from the fundamental Lagrangian 
(4.19) but until now this has not been achieved. Nevertheless, there are strong 
experimental and theoretical indications that this hypothesis is valid. In particular, 
the increase of the strong interaction strength at low energy strongly supports the 
idea of confinement. The energy dependence of the strong coupling constant has 
another important feature: the interaction strength vanishes in the limit of very high 
energies (or, correspondingly, at very small distances). This is called asymptotic 
freedom. As a consequence, this allows us to use perturbation theory to calculate 
strong interaction processes with highly energetic hadrons. We explain below why 
the coupling constant should be scale-dependent and then calculate how it “runs.” To 
introduce the concept of arunning coupling, we begin with familiar electromagnetic 
interactions, and then derive the results for a general renormalizable field theory 
and apply them to quantum chromodynamics. 


4.2.1 Running coupling constant and asymptotic freedom 


Let us consider two electrically charged particles. According to quantum electro- 
dynamics, their interaction via photon exchange can be represented by a set of 
diagrams, some of which are shown in Figure 4.3. The contribution of a particular 
diagram to the total interaction strength is proportional to the number of primitive 
vertices in the diagram. Each vertex brings a factor of e. Because e < 1, the largest 
contribution to the interaction (« e”) comes from the simplest (tree-level) diagram 
with only two vertices. The next order (one-loop) diagrams have four vertices and 
their contribution is proportional to e+. Hence, the diagrams in Figure 4.3 are sim- 
ply a graphical representation of the perturbative expansion in powers of the fine 
structure constant 


e 1 
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The interaction strength depends not only on the charge but also on the distance 
between the particles, characterized by the 4-momentum transfer 


Qa 


q? =p, =p; (4.21) 


Note that for virtual photons g* = lq" du| ¥ 9, that is, they do not lie on their mass 
shell. 
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Fig. 4.3. 


The diagrams containing closed loops are generically divergent. Fortunately, in 
so-called renormalizable theories, the divergences can be “isolated and combined” 
with bare coupling constants, bare masses, etc. What we measure in experiment is 
not the value of the bare parameter, but only the finite outcome of “its combination 
with infinities.” For instance, given a distance characterized by the momentum 
transfer g? = jx? (called the normalization point), we can measure the interaction 
force and thus determine the renormalized coupling constant (7) which becomes 
the actual parameter of the perturbative expansion. After removing and absorbing 
the infinities, there remain finite g?-dependent loop contributions to the interaction 
force (the vacuum polarization effect). They too can be absorbed by redefining the 
coupling constant which becomes q?-dependent, or in other words, begins to run. In 
the limit of vanishing masses (or for g >> m”), the expansion of the dimensionless 
running coupling “constant” o(q*) in powers of the renormalized coupling constant 
a(j1”) can be expressed on dimensional grounds as 


2 oo 2 
q n q 
oq?) = a(u*) + a7 (W") fy (5) += oo “urn 5) , (4.22) 
ad n=0 lad 
where fo = 1 and the other functions f,, are determined by appropriate n-loop 
diagrams. Since a(q*) = a(j17) at g? = ju”, we have 


fr) = 0 


forn > 1. 

If we consider a process with g-momentum transfer, we can use the running 
constant a(q’) instead of a(j7) as a small expansion parameter in the remaining 
finite diagrams. This corresponds to the resummation of finite contributions from 
divergent diagrams. However, to take advantage of this resummation, we have to 
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figure out the structure of perturbative expansion (4.22) and find a way to resum 
this series, at least partially. This can be done using simple physical arguments. 
Let us note that the value of the coupling constant a(q) should not depend on the 
normalization point \4”, which is arbitrary. Therefore, the derivative of the right 
hand side of (4.22) with respect to u? should be equal to zero: 


4 (ewrs(S)) <0 (4.23) 
ae Zo MW) Sn iz =0. . 


Differentiating and rearranging the terms, we obtain the following differential equa- 
tion for a(u7): 








do(1’) De rhi@er = 
ae | =F sunelr). (4.24) 
. YF D filayor I=0 
1=0 


where a prime denotes the derivative with respect to x = q?/*. The ratio of sums 
in (4.24) should not depend on x because the left hand side of the equation is 
x-independent. Therefore, to obtain the second equality in (4.24) we set x = 1. The 
requirement that the ratio does not depend on x imposes rather strong restrictions on 
the admissible functions f,(x) . From the second equality, we derive the following 
recurrence relations: 


dfn4i(x) 


=) +D frye fe). (4.25) 
dinx ae ee 


Problem 4.6 Verify that the general solution of these recurrence relations is given 
by 


fine) = Dein xy, (4.26) 
1=0 


where the numerical coefficient in front of the leading logarithm is equal to c, = 


(A(D)". 


The running constant @(q*) depends on q in the same way that a(ju?) depends 
on 27. Hence a(q’) satisfies the equation 


2 CO 
dalae) aq’) (5: final) (4.27) 
1=0 





d\nq? 


which follows from (4.24) by the substitution wo q°. Equations (4.24) and 
(4.27) are the well known Gell-Mann—Low renormalization group equations and 
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the expression on the right hand side of these equations, 
Bla) = fier + Ao +, (4.28) 


is called the 6 function. 

The results obtained are generic and valid in any renormalizable quantum field 
theory. The only input we need from concrete theory is the numerical values of 
the coefficients f/(1). For instance, to determine f/(1), one has to calculate the 
appropriate one-loop diagrams. Other coefficients require the calculations of higher- 
order diagrams. 

Let us assume that a(q7) < 1 for q? interest of (this assumption should be 
checked a posteriori). In this case we may retain in the 6 function only the leading 
one-loop term f/(1) a”, and neglect all higher order contributions. Equation (4.27) 
is then easily integrated, with the result 





2 a(t?) 
0) = Fe Ina in 
where w(j.7) reappears as an integration constant. The expression obtained corre- 
sponds to the partial resummation of series (4.22). As is clear from (4.26), this 
resummation takes into account only the leading (Inx)" contributions of all n-loop 
diagrams. Knowledge of the 6-function to two loops, combined with the Gell- 
Mann-Low equations, would allow us to resum next-to-leading logarithms. 


Problem 4.7 Find the behavior of the running coupling constant in the two-loop ap- 
proximation. (The coefficient f/(1) does not depend on the renormalization scheme, 
while f3(1), f5(1) etc. can be scheme-dependent). 


In quantum electrodynamics the coefficient f/(1) is positive and equals 1/37. 
In this case the coupling a(q7) increases as the charges get closer together (q” 
increases). It is a straightforward consequence of vacuum polarization. In fact, the 
vacuum can be thought of as a kind of “dielectric media” where negative charge 
attracts positive charges and repel negative ones. As a result, the charge is sur- 
rounded by a polarized “halo”, which screens it. Therefore, a negative charge, 
observed from far away (small g), will be reduced by the charge of the surround- 
ing halo. At higher q* we approach the charge more closely, penetrating inside the 
halo, and see a diminished screening of the charge. 

In quantum chromodynamics we have one-loop diagrams of the kind shown in 
Figure 4.3, where quarks and gluons should be substituted for the electrons and 
photons respectively. They also give a positive contribution to f/(1), proportional 
to the number of possible diagrams of this kind and hence to the number of quark 
flavors. As previously noted, gluons unlike photons, are charged. Hence, in quantum 
chromodynamics, in addition to the diagrams in Figure 4.3 there are also one-loop 
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Fig. 4.4. 


diagrams with virtual gluon bubbles (Figure 4.4). Their contribution to f/(1) is 
negative and the number of possible diagrams of this kind is proportional to the 
number of colors. For non-Abelian gauge theory with f massless flavors and n 
colors, 


1 
7a)= rl — 11n). (4.30) 


Problem 4.8 Why does the fermion contribution to (1) not depend on the number 
of colors? Why is the gluon contribution proportional to the number of colors, but 
not to the number of different gluons? Why do one-loop gluon diagrams, which are 
due to the coupling A*, not contribute to fi)? 


The formula for the running coupling constant was derived in the limit when 
fermion masses are negligible compared to q. It turns out that the contribution of 
fermions with mass m to f/(1) becomes significant only when q° becomes larger 
than m?. Hence the f function coefficients change by discrete amounts as quark 
masses are crossed. In quantum chromodynamics, f = 6 for energies larger than 
the top quark mass (~170 GeV) and f = 5intherange5 GeV « gq « 170 GeV. On 
the other hand, since the number of colors isn = 3, f/(1) is always less than zero. 
This has far-reaching consequences. As follows from (4.29), the running “strong 
fine structure constant”, a,(q7) = g?/4s,, decreases as q” increases. This is opposite 
to the situation in electrodynamics. The strength of strong interactions decreases at 
very high energies (small distances), so that a, becomes much smaller than unity 
and we can use perturbation theory to calculate highly energetic hadron processes. 
The approximation we used to derive a,(q7) becomes more and more reliable as 
q° grows and in the limit that g? —> 00, interactions disappear. This property of 
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quantum chromodynamics is known as asymptotic freedom. The decrease in the 
coupling constant is due to the gluon loops, which dominate over the fermion loop 
contribution and thus lead to antiscreening of the colors. Quark colors are mainly 
due to the polarized halo of gluons. 

The normalization point ju? in (4.29) is arbitrary, and the value of as(q7) does 
not depend on it. Introducing the physical scale A gcp, defined by 

bs Axon _ 120 
pe (11n — 2 f)os(u)’ 

we can rewrite (4.29) for the running strong fine structure constant in terms of a 
single parameter: 








127 
(lin — 2f)In(q?/AScp) 


Experimental data suggest that Agcp is about 220 MeV (to 10% accuracy). The 
strong coupling constant a, is 0.13 at g ~ 100 GeV and increases to 0.21 when 
the energy decreases to 10 GeV (in this energy range f = 5). According to (4.31), 
the strength of strong interactions should become infinite at g? = Ai, cp: However, 
this is not more than an informed estimate consistent with the confinement hypoth- 
esis. We should not forget that (4.31) was derived in the one-loop approximation 
and is applicable only if a,(q?) « 1, that is, at g? > Aden: At q? ~ Abecp: all 
loops give comparable contributions to the 6-function and when a, becomes the 
order of unity, (4.31) fails. To go further we have to apply nonperturbative methods, 
for instance, numerical lattice calculations. These methods also strongly support 
the idea of confinement. 

Quantum chromodynamics is a quantitative theory only when we consider highly 
energetic processes with g >> O(1) GeV. The strong force binding baryons in the 
nuclei is a low-energy process and cannot be calculated perturbatively. It can only 
be qualitatively explained as the result of collective multi-gluon and pion exchange. 





as(q?) = (4.31) 


4.2.2 Cosmological quark—gluon phase transition 


At high temperature and/or baryon density we can expect a transition from hadronic 
matter to a quark—gluon plasma. In the very early universe at temperatures exceeding 
Agocp ~ 220MeV, the strong coupling a,(T7) is small and most quarks and gluons 
only interact with each other weakly. They are no longer confined within particular 
hadrons and their degrees of freedom are liberated. In this limit the quark—gluon 
plasma consists of free noninteracting quarks and gluons, which can be described 
in the ideal gas approximation. Of course, there always exist soft modes with 
momenta gq” < Ai, cp» Which can by no means be treated as noninteracting particles; 
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but at T >> Agcp they constitute only a small fraction of the total energy density. 
Baryon number is very small and therefore we can neglect the appropriate chemical 
potentials. The contribution of the quark—gluon plasma to the total pressure is then 


Poe = — — B(T), (4.32) 


where function B(T) represents the correction due to the soft, low-energy modes 
and 


saa 7 
kag = Hy(2x 84 7 x3 x2x2% Ni). (4.33) 
30 8 
The first term on the right hand side here accounts for the contribution of eight 
gluons (with two polarizations each) and the second one is due to N¢ light quark 
flavors with m, < T (every flavor has three colors, two polarization states and the 
extra factor 2 accounts for antiquarks). 

Unfortunately the correction term B(7T’) cannot be calculated analytically from 
first principles. To get an idea of how it may look, we can use a phenomenological 
description of confinement, for instance, the MIT bag model. According to this 
model, quarks and gluons are described by free fields inside bags (bounded regions 
of space), identified with hadrons, and these fields vanish outside the bags. To 
account for appropriate boundary conditions in a relativistically invariant way, one 
adds to the Lagrangian “‘a cosmological constant” Bo (called the bag constant), 
which is assumed to vanish outside the bag. This “cosmological constant” induces 
negative pressure and prevents quarks escaping from the bag. In a quark—gluon 
plasma, where the bags “overlap”, B(T) = Bo = const everywhere. 

Given the pressure, the energy density and entropy can be derived using the 
thermodynamical relations (3.33) and (3.31): 


0B 4 0B 

— ap? fae = Kael +B-T. (4.34) 
As soon as the temperature drops below some critical value T., which on quite 
general grounds is expected to be about A gcp X 200 MeV, most quarks and gluons 
will be trapped and confined within the lightest hadrons — pions (sr, 1*). Their 
masses are about 130 MeV and at the time of the phase transition they can still 
be treated as ultra-relativistic particles. After quarks and gluons are captured, the 
total number of degrees of freedom drops drastically, from 16 (for gluons) + 12N ¢ 
(for quarks) to only 3 for pions. The pressure and entropy density of the ultra-rela- 
tivistic pions are then 


Ky = 7 
9g ~ 3%48 


4 
Ph = aes Sh = gkaT, (4.35) 


where «, = 27/10. 
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Whether the transition from the quark—gluon plasma to hadronic matter is char- 
acterized by truly singular behavior of the basic thermodynamical quantities or 
their derivatives (first or second order phase transitions, respectively), or whether 
it is merely a cross-over with rapid continuous change of these quantities, crucially 
depends on the quark masses. A first order phase transition is usually related to 
a discontinuous change of the symmetries characterizing the different phases. In 
SU (3) pure gauge theory, without dynamical quarks, the expected first order phase 
transition has been verified in numerical lattice calculations. In the case of two 
quark flavors one expects a second order phase transition (continuous change of 
symmetry). In the limit of three massless quarks, again for reasons of symmetry, 
we expect a first order transition. When the quark masses do not vanish, the appro- 
priate symmetries can be broken explicitly and a cross-over is expected. This is the 
situation most likely realized in nature: of the three quarks relevant for dynamics, 
two (u, d) are very light and one (s) is relatively heavy. However, despite more 
than 20 years of efforts, the character of the cosmological quark—gluon transition 
has not yet been firmly established. This is due to the great difficulty of computing 
with light dynamical quarks on the lattice. The possibility of a true phase transition, 
therefore, has not been ruled out. 

Irrespective of the nature of the transition, there is a very sharp change in the 
energy density and entropy in the narrow temperature interval around T,. This 
result is confirmed in lattice calculations and clearly indicates the liberation of the 
quark degrees of freedom. A first order phase transition is the most interesting for 
cosmology and therefore we briefly discuss it, assuming B(7’) = Bo = const, as in 
the bag model. This reproduces the bulk features of the equation of state obtained 
in numerical lattice calculations. The pressure and entropy density as functions of 
temperature are shown in Figure 4.5. At T,, even if the phase transition is first order, 





Fig. 4.5. 
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the pressure should be continuous, allowing both phases (hadrons and quark—gluon 
plasma) to coexist. Hence, equating ( 4.32) and (4.35) at T = T,, we can express 
the critical temperature T, through the bag constant Bo: 


1/4 1/4 
eel ee el eee Tg (4.36) 
Kgg = Kh (26 + 21N p)x? 


For Be ~ 220 MeV and for N; = 3 light quark flavors, JT. ~ 150 MeV. 


Problem 4.9 How should (4.32) be modified if the baryon number is different from 
zero? Using the condition pg.(T;, 4g) ~ 0, where jz is the baryon’s chemical 
potential, as an approximate criterion for the phase transition, draw in the T.-vg 
plane the shape of the transition line separating the hadron and quark—gluon phases. 
Why does the above criterion give us a good estimate? 


In the case of a first order phase transition, the entropy density is discontinuous 
at the transition and its jump, Asyg =(4/3) (Kgg — x,)T;, is directly proportional to 
the change in the number of active degrees of freedom. A first order phase transition 
occurs via the formation of bubbles of hadronic phase in the quark—gluon plasma. 
As the universe expands, these bubbles take up more and more space and when what 
is left is mainly the hadronic phase the transition is over. During a first order phase 
transition the temperature is strictly constant and is equal to T,.. The released latent 
heat, Ae = T,Asg¢, keeps the temperature of the radiation and leptons unchanged 
in spite of expansion. To estimate the duration of the transition one can use the 
conservation law for the total entropy. 


Problem 4.10 Taking into account that in the quantum chromodynamics epoch, in 
addition to quarks and gluons, there are photons, three flavors of neutrinos, electrons 
and muons, verify that the scale factor increases by a factor of about 1.5 during the 
phase transition. 


If the transition is of second order or a cross-over, the entropy is a continuous 
function of temperature, which changes very sharply in the vicinity of T,. As the 
universe expands the temperature always drops, but during the transition it remains 
nearly constant. For the case of a cross-over transition, the notion of phase is not 
defined during the transition. 

As we have already mentioned, only a first order quantum chromodynamics phase 
transition has interesting cosmological consequences. This is due to its “violent 
nature.” In particular, it could lead to inhomogeneities in the baryon distribution 
and hence influence nucleosynthesis. However, calculations show that this effect 
is too small to be relevant. There could be other, more speculative consequences, 
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for instance, the formation of quark nuggets, the generation of magnetic fields 
and gravitational waves, black hole formation, etc. These are still the subjects of 
investigation. At present, however, it is unlikely that a quantum chromodynamics 
phase transition leaves the observationally important “imprint.” It looks like an 
interesting but rather “silent” epoch in the evolution of the universe. 


4.3 Electroweak theory 


The most familiar weak interaction process is neutron decay: n —> pev,. In Section 
3.5 we described it using the Fermi four-fermion interaction theory, which is very 
successful at low energies. However, this theory is not self-consistent because it 
is not renormalizable and violates unitarity (conservation of probability) at high 
energies (above 300 GeV). The Fermi constant Gr, characterizing the strength 
of the weak interactions, has dimension of inverse mass squared. Therefore, it is 
natural to assume that the four-fermion vertex is simply the low-energy limit of 
a diagram made of three-legged vertices with dimensionless coupling g,,, which 
describes the exchange of a massive vector boson W (Figure 4.6). 

At energies much smaller than the mass of the intermediate boson, one can 
replace the boson propagator with 1 /Mj;, and the diagram shrinks to the four-legged 
diagram with effective coupling constant Gr = O(1) g2,/M ae Noting the vectorial 
nature of the weak interactions, we are led to a description using gauge symmetries. 
However, an obstacle immediately arises. We have noted above that gauge bosons 
should be massless because the mass term spoils gauge invariance, renormalizability 
and unitarity. This problem was finally resolved by the renormalizable standard 
electroweak theory, where the masses of all particles (including intermediate gauge 
bosons) emerge as a result of their interaction with a classical scalar field. The 
electroweak theory is based on a unification (or, more precisely, on a “mixing”’) 
of electromagnetic and weak interactions within a model based on SU(2) x U(1) 








Fig. 4.6. 
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gauge symmetry. The gauge coupling constants of the SU(2) and U(1) groups 
should be taken as independent, and therefore the SU(2) x U(1) group cannot be 
“unified” in a single U(2) group. 


4.3.1 Fermion content 


In contrast to quarks, leptons are not involved in strong interactions, but both 
leptons and quarks participate in weak interactions. The three electrically charged 
leptons, the electron e, the muon yj, and the t-lepton, are partnered by neutrinos 
Ve, V, and v, respectively. 

The neutrino masses are very small and we will first consider them as if they were 
massless. The neutrino has spin 1/2 and, in principle, the normalized component 
of spin in the direction of motion, called the helicity, can take the value +1 or —1. 
It has been found in experiments, however, that all neutrinos are left-handed: they 
have helicity —1, that is, their spins are always directed antiparallel to their veloc- 
ity. All antineutrinos are right-handed. Hence, in weak interactions, the symmetry 
between right- and left-handedness (parity- P) is broken and the corresponding the- 
ory is chiral. Note that the notion of helicity is Lorentz-invariant only for massless 
particles which move with the speed of light, otherwise one can always go to a 
frame of reference moving faster than the particle and change its helicity. 

The quarks and leptons are massive. However, because of the chiral nature of the 
theory, mass terms cannot be introduced directly without spoiling gauge invariance. 
In electroweak theory the masses arise as a result of interaction with a classical scalar 
field. They will be considered later; until then, we will treat all fermions as if they 
were massless particles. 

In weak interactions, the charged leptons can be converted into their correspond- 
ing electrically neutral neutrinos. As a consequence the intermediate vector boson 
must carry electric charge. Its antiparticle has the opposite charge and hence there 
should be at least two gauge bosons responsible for weak interactions. The simplest 
gauge group which can incorporate them is the SU(2) group. 

Only the left-handed electron e; can be converted into the left-handed neutrino 
ve. They form an SU (2) doublet and transform as 


w= c) —> Uy, (4.37) 
E, 


where U is a unitary 2 x 2 matrix with det U = 1, and v, and e; are Dirac spinors 
describing the massless left-handed neutrino and electron. The right-handed elec- 
tron is a singlet with respect to the SU(2) group: wR = er — Wp. The concrete 
form of Dirac spinors for chiral states depends on the Dirac matrix representa- 
tion used. For instance, in the chiral representation the left-handed fermions are 
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described by four component spinors with the first two components equal to zero. 
To make concrete calculations of processes, the reader should be familiar with the 
standard algebra of Dirac matrices, which can be found in any book on field theory. 
We will not need it here. 

Other leptons also come in doublets and singlets: 


(4) «we Ge. (4.38) 
Mb L T L 


The three different generations of leptons have very similar properties. Because 
weak interactions convert particles only within a particular generation, the lepton 
numbers are conserved separately. 

The six quark flavors also form three generations under weak interaction: 


u Cc t 
o b) UR, dai (<) > CR, Ses & ’ tR, Dy. (4.39) 
L L L 


We have skipped the color indices which are irrelevant for electroweak interactions. 
The flavors d’, s’, b’ entering the doublets are linear superpositions of the flavors 
d,s, b, conserved in strong interactions. As a consequence, the weak interactions 
violate all flavor conservation laws. 

Since the individual Lagrangians for each generation have the same form, the 
fermionic part of electroweak Lagrangian is obtained essentially by replication 
of the Lagrangian for one particular lepton generation. Therefore we consider, for 
example, only the electron and its corresponding neutrino. However, the importance 
of the quark generations should not be underestimated. The anomalies which would 
spoil renormalizability are canceled only if the number of quark generations is equal 
to the number of lepton generations. 

The SU(2) group has three gauge bosons. As we have already mentioned, two of 
them are responsible for the charged weak interaction. The third boson is electrically 
neutral since only then is it its own antiparticle. However, it cannot be identified 
with the photon, because the photon should be an Abelian U(1) gauge boson. 
Because one of the partners in the doublet (4.37) is electrically charged, it makes 
sense to try to incorporate both the electromagnetic and weak interactions into the 
SU(2) x U(A) group. The corresponding Lagrangian 


Ly =ippy" (0, +igAy +ig'YByu) by +iWey" (du +ig'YrBu) Vr (4.40) 
is invariant under both SU(2) transformations, 


wy, > Ud, Wr vr, 


4.3 Electroweak theory 153 
and U(1) transformations, 
y= CSTD ay, Wr C8 ERD yp 


if the gauge fields A,, and B,, transform according to (4.11) and (4.5) respectively. 
The U(1) hypercharges Y; and Yr can be different for the right- and left-handed 
electrons. The only requirement is that they should be able to reproduce the correct 
values of the observed electric charges. 

In electroweak theory, three out of four gauge bosons should acquire masses and 
one boson should remain massless. Additionally, the fermions should become mas- 
sive. These masses can be generated in a soft way via interaction with the classical 
scalar field. In this case the theory remains gauge-invariant and renormalizable. 
To demonstrate how this mechanism works, let us first consider the simplest U(1) 
Abelian gauge field which interacts with a complex scalar field. 


4.3.2 “Spontaneous breaking” of U(1) symmetry 
The Lagrangian 
£L=3((8" + ieA" 9)" (8, + 1eA,)p) — Vy") — GFA), (4.41) 
where F? = F,,,F“”, is invariant under the gauge transformations 
gp>e to, A, > Ay t+d,A. 


For g 4 0, we can parameterize the complex scalar field g by two real scalar fields 
x and ¢, defined via 


yg = x exp(iel). (4.42) 


The field x is gauge-invariant and the field ¢ transforms as ¢ — ¢ — 2. We can 
combine the field A,, and ¢ to form the gauge-invariant variable 


Gy = Ay + ane. (4.43) 


Lagrangian (4.41) can then be rewritten entirely in terms of the gauge-invariant 
fields x and G,,: 


1 1 e 
re =O" xO =Voc)= iP°@) + ZU O"Gy. (4.44) 


If the potential V has a minimum at x9 = const 4 0, we can consider small pertur- 
bations around this minimum, xy = xo + @, and expand the Lagrangian in powers 
of @. It then describes the real scalar field ¢ of mass my = VV.xx(Xo) , which inter- 
acts with the massive vector field G,,. The mass of the vector field is Mg = exo. If 
Lagrangian (4.41 ) is renormalizable, one expects that after rewriting it in explicitly 
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gauge-invariant form, it will remain renormalizable. This is what really happens in 
spite of the fact that the vector field acquires mass. If xo 4 0, the physical fields cor- 
responding to observable particles are the gauge-invariant real scalar field (Higgs 
field) and the massive vector field. Of course, after we have rewritten the Lagrangian 
in terms of the new variables, the total number of physical degrees of freedom does 
not change. In fact, the system described by (4.41) has four degrees of freedom per 
point in space: namely, two for the complex scalar field and two corresponding to 
the transverse components of the vector field. The Lagrangian expressed in terms 
of gauge-invariant variables describes a real scalar field with one degree of freedom 
and a massive vector field with three degrees of freedom. 

This method of generating the mass term is known as the Higgs mechanism and its 
main advantage is that it does not spoil renormalizability. The vector field acquires 
mass and its longitudinal degree of freedom becomes physical at the expense of 
a classical scalar field. Of course, (4.44) is invariant with respect to the original 
gauge transformations which become trivial: x — x and G, — G,. However, if 
one tries to interpret the gauge-invariant field G,, as a gauge field like A,,, then one 
erroneously concludes that gauge invariance is gone. This is why one often says that 
the symmetry is spontaneously broken. Such a statement is somewhat misleading 
but we will nevertheless use this wide accepted, standard terminology. 

The gauge-invariant variables can be introduced and interpreted as physical 
degrees of freedom only if x9 ~ 0 and only when the perturbations around xo are 
small, so that y = x9 + @ # Oeverywhere in the space. Otherwise (4.42) becomes 
singular at x = 0 and the fields x and ¢ used to construct the gauge-invariant 
variables become ill-defined. In the case x9 = 0, one has to work directly with the 
Lagrangian in its original form (4.41). 


4.3.3 Gauge bosons 


In electroweak theory, the masses of the gauge bosons can also be generated 
using the Higgs mechanism. Let us consider the SU(2) x U(1) gauge-invariant 
Lagrangian 


Ly = 3D"p)' Div) — Viv"), (4.45) 


where is an SU(2) doublet of complex scalar fields, ¢ denotes the Hermitian 
conjugation and 


i 
D, = 0, t+igA, — 58 Bu. (4.46) 


We have assumed here that the hypercharge of the scalar doublet is Y, = —1/2, 
so that under the U(1) group it transforms as ~ — e28*y. The scalar field can be 
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eee se ee orn i) = 4.47 
rte ear ee NG = xCPo; (4.47) 


where x is areal field and ¢), 2 are two complex scalar fields satisfying the condition 
|ty [ae lex7 = |. The definition of the SU(2) matrix ¢ and the constant vector yo 
can easily be read off the last equality. Substituting p = x Cg in (4.45), we obtain 


written as 


1 x? 1 1 
Ly = 5 OP x OuX — V(x) + > (6, = 5#'B,) («c" — 5B") Po: 
(4.48) 


where 
: ve 
Gy =¢ Aud —6 4 (4.49) 
are SU(2) gauge-invariant variables. 


Problem 4.11 Consider the SU (2) transformation 


@)-@-8) 
b2 b2 bo 


accompanied by the U(1) transformation ¢ > 28g and verify that 


—h eh ox Seat 
c+ tk as )) =UCE, (4.51) 


—e7284E*  e284G, 


where 


(8. 4 _ fe28* 0 


(Hint Note that an arbitrary SU (2) matrix has the same form as matrix ¢ with ¢), fo 
replaced by some complex numbers a, f.) 


Using this result, it is easy to see that the field G,, is SU(2) gauge-invariant, that 
is, 
G,— G, (4.53) 
as 
¢—> UC, A, > UA,U"! +(i/g) (a,U)U-'. 


Thus, we have rewritten our original Lagrangian (4.45) in terms of SU (2) gauge- 
invariant variables x, G,,, By. 
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The fields B,, and G,, change under U(1) transformations. Field B,, transforms 
as 


B, > B, + 4,2. (4.54) 


Taking into account that A,, — A, and ¢ — CE, where matrix E is defined in 
(4.52), we find that 


6G, Sb Gr gE uk (4.55) 


under the U(1) transformations. 
Matrix G,, is the Hermitian traceless matrix 


af Ge) SW 
c.e( iy A ep) (4.56) 





where W/ are a conjugate pair of complex vector fields and G}, is a real vector 
field. In parameterizing matrix G, we have used the standard sign convention and 
normalization adopted in the literature. Substituting this expression in (4.48) and 
replacing fields Gi and B,, with the “orthogonal” linear combinations Z,, and A,,, 


Ay \ _{ cos@,  sin@, Bu 
Garey sale (4.57) 


where 6,, is the Weinberg angle and 


& 


cos 0, = ————-., (4.58) 
/ g? + gi 
we can rewrite (4.48) in the following form: 
1 2 12) 1,2 29 
Ly = 5" x8.x — VO?) + eT EK 7,2 4 wwe (4.59) 
Because 
trF°(A) = tr F°(G), 
where F* = F,,,F””, the Lagrangian for the gauge fields is 
Lp = —{F(B) — fF’). (4.60) 


Problem 4.12 Substituting (4.56) in (4.60) and using the definitions (4.13) and 
(4.57), verify that (4.60) can be rewritten as 


Lp = —4F(A) — fF(Z) — SFw(WtyFY'(w_), (4.61) 
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where 
F(A) = O,Ay — Ay +ig sinO,(W, Wt — WW), ie 
Fyu(Z) = 8,Zy — AZ, +ig cos 0y(W,, Wt — WW), 
and 
al ms als + aes 
EE eet aS eas (4.63) 





Di = 9u Fig sin OwAy F ig COs OyZy,. 


The terms which are third and fourth order in the field strength describe the inter- 
actions of the gauge fields. Draw the corresponding vertices. 


We now turn to the renormalizable scalar field potential 


Xr 


V(x?) = F(x? - x8)’ (4.64) 


In this case, the field x acquires a vacuum expectation value xo, corresponding 
to the minimum of this potential. Let us consider small perturbations around this 
minimum, so that x = xo + @. It is obvious that the Lagrangian Ly + Lr then 
describes the Higgs scalar field ¢ of mass 


mu = JVxx(Xo) = V22X0, (4.65) 


massive vector fields Z,, and W*, with masses 


Me = Ve +875, Mie * = Mz cos 6y, (4.66) 


and the massless field A,,. This massless field is responsible for long-range inter- 
actions and should be identified with the electromagnetic field. 


Problem 4.13 Using (4.55), verify that under U(1) transformations 


is 1 
Wi eHSWE, Ay Aut Sa Zu > Zu. (4.67) 





Thus we see that W/- transform as electrically charged fields. Comparing these 
transformation laws with those of electrodynamics, we can identify the electric 
charge of the Wr bosons (see also ( 4.63)) as 


e= g’ cos, = gsind,. (4.68) 


The boson Z,, is electrically neutral. As we will see, W and Z bosons are responsible 
for the charged and neutral weak interactions respectively, and the “weakness” of 
these interactions is due to the large masses of the intermediate bosons rather than 
the smallness of the weak coupling constant g. It follows from (4.68) that the “weak 
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fine structure constant” 

2? a 

4n  Amsin* 0, 





(4.69) 


dy = 


is in fact larger than a = e?/4a ~ 1/137. 


4.3.4 Fermion interactions 


Combining the left-handed doublet {, with the scalar field, we can easily build 
the corresponding fermionic SU(2) gauge-invariant variables: 


w= lyse, (4.70) 


The right-handed electron, Wp = ep, iS a Singlet with respect to the SU(2) group. 
Because under U(1) transformations eb) —> eis Vidape and ¢ — CE, we obtain 


e —ig/YpAp—lywye 
We cis Mls, 


Defining the SU (2) gauge-invariant left-handed electron and neutrino as 


wi(). (4.71) 


eL 
we have 


ef (el ca _toff1 ret 
vl > eis G Wry ep >e i’ Gt¥i re, | er —> € 18 YR 


el. (4.72) 
The neutrino has no electrical charge and therefore should not transform. Hence the 
hypercharge of the left-handed doublet should be taken to be Y; = 1/2. In this case 
the left-handed electron transforms as e; > e~'’*e,,. To ensure the same value for 
the electric charges of the right- and left-handed electrons we have to put Yr = 1. 
Taking into account the transformation law for the vector potential A, (see (4.67)), 
we conclude that the electric charge of the electron is equal to e given in (4.68). 

Substituting 2, = CW, in (4.40 ) and using definition (4.49), we can rewrite 
the Lagrangian for fermions in terms of the gauge-invariant variables: 


= i Br en 
Lp =iWyy" (a, +igG, + 58'B,) we+ iWry" (Ou + ig’ By) Wr. (4.73) 
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Alternatively, using definitions (4.56), (4.57) and (4.71), we obtain 
LS i(éy" dye + Dry" dvr) 


= § = _ 
—e(éy"e) Ay + —eey Mex) Wi +@zy"v1) W,) 


B 


[= Ow cos 26,, g 
+ 








g(éry"er) — ——— g(@_y"ez) + 


Leg Zz 
COS 6, 2 cos 6, 20s 6, VEY | ” 


(4.74) 
where the well known properties of the Dirac matrices have been used to write 
ery" er + eryer — eye. 


The first cubic term is the familiar electromagnetic interaction and the next terms 
describe the charged and neutral weak interactions due to the exchange of W*+ 
and Z bosons respectively. Note that the right-handed electrons participate only in 
electromagnetic and neutral weak interactions, and not in the charged interactions. 

Replacing e, ve in (4.74) by uw, v, /t, vz, we obtain the Lagrangian for the sec- 
ond/third generation of leptons. Let us consider muon decay. The appropriate tree 
diagram is shown in Figure 4.7 (the reader must take care to correctly identify the no- 
tation used for wave functions and particles in diagrams; for example, the conjugated 
wave function b can describe a neutrino as well as an antineutrino depending on the 
orientation of the diagram). Because the muon mass is much smaller than the mass of 
the W boson, the boson propagator can be replaced by ig,,,/M is and the diagram in 
Figure 4.7 reduces to the four-fermion diagram corresponding to the coupling term 


2V2G ri," wer Ya Ve), 


= 





Fig. 4.7. 
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where 


1? 3? m a . 
= = “- ~ 1.166 x 10°? GeV~ (4.75) 
4/2My  =—/2 Mi, 
is Fermi coupling constant. The experimental value of Gr, the Weinberg angle 
Oy & 28.7° (sin? eas 0.23), determined in the neutral current interactions, and 


the measured masses 


Gr 





My ~ 80.4GeV, Mz ~ 91.2GeV, 


are in very good agreement with the theoretical predictions of the standard 
electroweak model. From (4.69), “the weak fine structure constant” a, ~ 1/29 is 
seen to be 4.5 times larger than the fine structure constant. As follows from (4.66) 
and (4.68), the expectation value for the Higgs field is 


Xo ~ 250 GeV. (4.76) 


2My sin Oy 
7 e 
However, since the quartic coupling constant A can be arbitrary, the Higgs mass, 
given in (4.65), is not predicted. Higgs particles have not yet been discovered. 
The experimental lower bound on their masses is my > 114 GeV. Requiring 
self-consistency of the theory, namely, the validity of the perturbative expansion 
in A, one could expect that A < 1 and hence my cannot greatly exceed 350 GeV. 


Problem 4.14 Given that the electric charges of u and d quarks equal +2/3 and 
—1/3 respectively, determine their hypercharges. Derive the Lagrangian for the 
first quark generation. (Note that both wr and dp are present in the Lagrangian as 
SU (2) singlets.) Draw the vertices describing the quark weak interactions. 


The neutron decay can be interpreted as the underlying quark process: d > 
u + W. As a result the neutron, which is a bound state of three quarks udd, is 
converted into the proton consisting of uud quarks. Because quarks always appear 
in bound systems, the calculation of the weak interactions of hadrons is more 
complicated and suffers from various uncertainties. 


4.3.5 Fermion masses 


Until now we have treated all fermions as massless. This is obviously in disagree- 
ment with experiment. However, fermion masses, introduced by hand, would spoil 
gauge invariance and renormalizability. Therefore, the only way to generate them 
is again to use the Higgs mechanism. 
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The gauge-invariant Yukawa coupling between scalars and fermions for the first 
lepton generation is 


Ly = —f.( Bi ver + enelyt), (4.77) 


where f, is the dimensionless Yukawa coupling constant. This term is obviously 
SU(2)-invariant and, if the hypercharges satisfy the condition 


Yr, =Yr+ Yo, (4.78) 


itis also invariant with respect to U(1) transformations. Substituting » =x Co (see 
(4.47)) in (4.77), we can rewrite the Yukawa coupling in terms of SU (2)-invariant 
variables as 


Ly =—fex Vi poer thc. = —fex@rer + @rer) = —fexee, (4.79) 


where h.c. denotes the Hermitian conjugated term. To write the last equality we 
have used a well known relation from the theory of Dirac spinors. If the scalar field 
takes a nonzero expectation value, so that x = x9 + @, the electron acquires the 
mass 


Me = feXo- (4.80) 


With xo given in (4.76) and f, ~ 2 x 107°, we get the correct value for the electron 
mass. The appearance of such a small coupling constant has no natural explanation 
in the electroweak theory, where f, is a free parameter. The term f.@ée describes 
the interaction of Higgs particles with electrons. Note that the particular form of the 
Yukawa coupling (4.77) gives mass only to the lower component of the doublet. 
The neutrino remains massless. 

For quarks some complications arise. First of all, both components of the doublets 
should acquire masses. Second, to explain flavor nonconservation in the weak 
interactions, we have to assume that the lower components of the SU (2) doublets are 
superpositions of the lower quark flavors and hence are not quark mass eigenstates. 
This suggests we should simultaneously consider all three quark generations. Let 
us denote the upper and lower components of the SU(2) gauge-invariant quark 
doublets by 


ui =(u,c,t) 
and 


d' =(d', con b’) 
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respectively, where i = 1, 2, 3 is the generation index. The lower components are 
linear superpositions of the appropriate flavors, 


d' =Vid', (4.81) 


where V; is the unitary 3 x 3 Kobayashi-Maskawa matrix. The general quark 
Yukawa term can then be written as 


Ly = FG XQ. Godg — FExQ. Ping +hee., (4.82) 


Qi =7)e 1 =(;): (4.83) 


The second term on the right hand side in (4.82) is also gauge-invariant and generates 
the masses of the upper components of the doublets. Expression (4.82), rewritten 
in terms of the original flavors, gives the mass term 


where 


Lng = — (Vit Fave) xatak — fiyxiiuh + he. (4.84) 
Taking the matrices ff and f;; such that 


(Vis t4Vz) x0 = me5nx (4.85) 
and 
SiiX0 = m}'bi;, (4.86) 


we get the usual quark mass terms. The Yukawa coupling constant is largest for the 
top quark, f' ~ 0.7 (m; ~ 170 GeV). For the other quarks it is more than 10 times 
smaller. 

Neutrino masses, which — according to measurements — are different from zero, 
can be generated in an almost identical manner. In this case, the neutrino flavors 
naturally mix in a similar way to the quark generations and this leads to neutrino 
oscillations. 


4.3.6 CP violation 


The parity operation P corresponds to reflection (f, x, y, z) > (tf, —x, —y, —z) and 
converts left-handed particles into right-handed particles without changing their 
other properties. Charge conjugation C replaces particles by antiparticles without 
changing handedness. For instance, the C operation converts a left-handed electron 
to a left-handed positron. Any chiral gauge theory is not invariant with respect to 
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VE, VR 
Ww = w- 


Fig. 4.8. 


P and C operations applied separately and in the Standard Model these symmetries 
are violated in a maximal possible way. 

It is obvious that (4.40) is not invariant with respect to the replacement L <=> 
R because parity operation converts left-handed neutrinos into nonexistent right- 
handed neutrinos. Similarly, charge conjugation converts left-handed neutrinos into 
nonexistent left-handed antineutrinos. However, the combined operation CP, which 
interchanges left-handed particles with right-handed antiparticles, seems to be a 
symmetry of the electroweak theory. In fact, CP is a symmetry of the Lagrangian 
without quarks. As an example, let us find out what happens to the charged weak 
interaction coupling term in (4.74), 


ce 
J2 


under CP transformations. The first term here corresponds to the vertex in 
Figure 4.8 and can be interpreted as describing left-handed electron and right- 
handed antineutrino “annihilation” with the emission of a W~ boson. Recall that 
an arrow entering a vertex corresponds to a wave function y while an outgoing line 
corresponds to the conjugated function w. If the arrow coincides with the direction 
of time, then the corresponding line describes the particle; otherwise it describes 
the antiparticle. Hence the W* boson line entering the vertex in Figure 4.8 corre- 
sponds to its antiparticle, that is, the W~ boson. The wave function e; describes the 
left-handed electron e; , and Db, corresponds to the right-handed antineutrino Dg. 
Under charge conjugation C all arrows on the diagram are reversed (Figure 4.9). 
The right-handed antineutrino goes to the right-handed neutrino, )g — vp, and the 
left-handed electron converts to the left-handed positron, e, — @, <> ér. Thus 


(ory“er) Wi +@ry"v1) W,). (4.87) 


&§ + Be _ 
geey ee) Vi faery’) Wi. (4.88) 
After applying the P operation, this term coincides with the second term in 
Lagrangian (4.87). Likewise, the second term in (4.87) converts to the first one. 
Therefore expression (4.87) is CP-invariant. The reader can verify that the other 
terms in (4.74) are also C P-invariant. 
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Fig. 4.9. 


The term describing the charged weak interactions for quarks can be written as 
(4.87): 


mF Laiy"aiymt +(diy"ui,) Wz), (4.89) 

or, after rewriting it in terms of quark flavors, as 
al (a ya} ) wt +(V/)*(aiy"u),) W,). (4.90) 

Under C P transformation, the first term becomes 
Vj(av"di wt > V{(diy"u,,) We, (4.91) 


; \* 
and coincides with the second term only if V; = (vi) . In other words, (4.90) is 
C P-invariant only if the Kobayashi-Maskawa matrix is real-valued; otherwise C P 
is violated. An arbitrary 3 x 3 unitary matrix 


V; — Ze exp(i6j;) 


is characterized by three independent real numbers r and by six independent phases 
6. The quark Lagrangian is invariant with respect to global quark rotations: g; > 
exp(ia;) q;. Using the six independent parameters for six quarks, we can eliminate 
five 6 phases as having no physical meaning. One phase is left over, however, 
because bilinear quark combinations are insensitive to the overall phase of the 
quark rotation. Because of this one remaining phase factor the Kobayashi-Maskawa 
matrix will generally have complex elements and therefore one can expect CP 
violation. This CP violation is due to the complex-valued coupling constant in the 
charged weak interaction term. 


Problem 4.15 Could we expect CP violation in a model with only two quark 
generations, where the quark mixing is entirely characterized by Cabibbo angle? 


The violation of CP symmetry was first observed in 1964 in kaon K°(d5) decay 
and then, in 2001, in the B°(db) meson system. There is strong evidence that the 
Kobayashi—Maskawa mechanism is responsible for this CP violation. 
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As we will see, CP violation plays a very important role in baryogenesis, en- 
suring the possibility of different decay rates for particles and their antiparticles in 
particular decay channels. If this were not the case, generation of baryon asymmetry 
would be impossible. 

Note that if we accompany the CP transformation by time reversal T(t + —t), 
which reverses the direction of arrows on the diagrams and changes the handedness, 
(4.90) remains invariant. The Lagrangian of the Standard Model is CPT-invariant. 
This invariance guarantees that the total decay rates, which include all decay chan- 
nels, are the same for particles and their antiparticles. 


4.4 “Symmetry restoration” and phase transitions 


A classical scalar field x interacts with gauge fields which influence its behavior. 
In the early universe this influence can be described using an effective potential. 
At very high temperatures the effective potential has only one minimum, at x = 0, 
and the homogeneous component of x disappears. As a result all fermions and 
intermediate bosons become massless and one says that the symmetry is restored. 
In fact, as we have pointed out, the gauge symmetry is never broken by the Higgs 
mechanism. Nevertheless, in deference to the commonly used terminology we use 
the term symmetry restoration to designate the disappearance of the homogeneous 
scalar field. 

As the universe expands the temperature decreases. Below a critical temperature 
the effective potential acquires an energetically favorable local minimum, at x(T) 4 
0, and the transition to this state becomes possible. Depending on the parameters 
of the theory this can be either a phase transition or a simple cross-over. 

In this section we investigate symmetry restoration and phase transitions in gauge 
theories. 


4.4.1 Effective potential 


To introduce an idea of the effective potential we first consider a simple model 
describing a self-interacting real scalar field, which satisfies the equation 


x +V(x) = 0, (4.92) 


where V(x) = 0V/0x. The field x can always be decomposed into homogeneous 
and inhomogeneous components: 


xt, x) = x(t) + O(,X), (4.93) 


so that the spatial average of #(x,f) is equal to zero. Substituting (4.93) into (4.92), 
expanding the potential V in powers of ¢, and averaging over space, we obtain the 
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following equation for x(t): 
veg 7 1 : 
Ka + VG) + SV"O(G") = 0, (4.94) 


where the higher-order terms ~ (¢°) , etc. have been neglected. In quantum field 
theory this corresponds to the so called one-loop approximation. We will now show 
that in a hot universe the last term in (4.94) can be combined with V’(x) and 
rewritten as the derivative of an effective potential Var(%, T) . To this purpose, let 
us calculate the average (97). 


Scalar field quantization In the lowest (linear) order, the inhomogeneous modes @ 
obey the equation 


Pg + V"(X)G =O, (4.95) 
obtained by linearizing (4.92). Assuming that the mass 
m3(X) = V(x) = 0 


does not depend on time, and neglecting the expansion of the universe (this is a 
good approximation for our purposes), the solution of (4.95) is 


) d°*k 
(22 )3/? 7 





1 ; : ; 
(x,t) = (era 4p gitettny (4.96) 


J 20; 


where 
wo = Vk27+ V(X) = /k2+ m5, 


k = |k|, anda, , ae — (ay y" are the integration constants. Our task is to calculate 
both quantum and thermal contributions to (¢) : 

In quantum theory, the field #(x, t) = ¢,(t) becomes a “position” operator ¢x(t) 
and the spatial coordinates x can be considered simply as enumerating the degrees of 
freedom of the physical system. That is, at each point in space, we have one degree of 
freedom — a field strength — which plays the role of position in a configuration space. 
Hence, a quantum field is a quantum mechanical system with an infinite number of 
degrees of freedom. As in usual quantum mechanics, the position operators ¢,(t) 
and their conjugated momenta 


fty = IL/I = Igy/dt 
should satisfy the Heisenberg commutation relations: 


ddy(t) 
ot 





[dx(t), ty(0)] = a0 ; = id(x —y), (4.97) 
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where 


[dx(t) , Aty(t)] = x(t) tty(t) — sty(t) bx(0) 


and Planck’s constant is set to unity. The field operator dx(t) obeys (4.95) and its 
solution is given in (4.96), but now the integration constants should be considered 
as time-independent operators 4, , at . Substituting (4.96) in (4.97), we find that 


the operators at , G, Satisfy the commutation relations 





[az.a¢]=s(k-k’), [a,, 47] = [at at] =o. (4.98) 


Except for the appearance of the 6 function, these behave like the creation and 
annihilation operators of harmonic oscillators. The Hilbert space in which these 
operators act then resembles the Hilbert space of a set of harmonic oscillators. The 
vacuum state |0) is defined via 


a, |0) = 0 (4.99) 


for all k, and corresponds to the minimal energy state. The vectors 


_ at)’ 
Ink) = Sai 0) (4.100) 


are interpreted as describing nx particles per single quantum state characterized by 
the wave vector k. 





Problem 4.16 The operator Ny = at a, corresponds to the total number of 
particles with wave vector k. Using commutation relations (4.98), verify that 
Ny nx) = 6(0) ny |nx) . The appearance of 6(0) can easily be understood if we 
take into account that the total number of quantum states with a given momentum 
is proportional to the volume. Because we consider an infinite volume, the factor 
6(0) simply reflects this infinity. The number of particles per unit volume is finite 
and equal to the occupation number nx. 
Verify that 
ata— 

(at ay) = Te Tere — mys), 

(mx Mx) (4.101) 


(aay) = (a Ge) 9 = 0. 








Now we can proceed with the calculation of (¢?) . Let us take a quantum state 
with occupation numbers n, in every mode k. In a homogeneous isotropic universe 
the spatial average can be replaced by the quantum average. Squaring (4.96) and 
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using the results (4.101), we find that 


; 1 ke 1 
(°(x)) = 5 if (; +m.) dk. (4.102) 
we fl? + mg) 


Vacuum contribution to Ver First we consider only vacuum fluctuations and set 
nx = 0. The integral in (4.102) is divergent as k — oo. To reveal the nature of 
the divergence we regularize this integral by introducing the cut-off scale k, = M. 
Taking into account that m5 xX) = V”, we can rewrite the third term in (4.94) as 








aye (9 ee = 


, 4.103 
5 ( ) 


+mr(xZ) 9X 


am3(x) fk 
8x2 we 


where 


Vp = ai ie ma Gpeak = He) (4.104) 


is simply equal to the energy density of the vacuum fluctuations. 
Using (4.103), (4.94) becomes 


Xa + Vex) = 0, (4.105) 


where Vere (X%) = V + Vg is the one-loop effective potential. The integral J which 
enters (4.104) can be calculated exactly: 


1 m 
I(m) = — | M(2M? 2) / M2 2 4 In ; 
(m) | ( +m) /M?+m?+m a | 


Taking the limit M — co, we obtain the following expression for the effective 
potential: 





mi (X) eX) 
64:12 pr” 





Vert = V + Voo (4.106) 


where the divergent term 


a Mt yp ms 2M 9 1 
—— —_— n ees 
An? 16m? 3202 3/4 M2 











can be absorbed by a redefinition of constants in the original potential. For instance, 

in the case of the renormalizable quartic potential 
2 

ike, = a 


5 x + Ao, (4.107) 


_ Xo _ 
Vi = ote 
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the mass is 
m5, = V" = 3doX? + mo. 


The divergent terms come to be multiplied with x*, %*, ¥°, and can be combined 
with appropriate terms in V(x). As a result, the bare constants io, mi, and Ao 
are replaced by the finite renormalized constants Ap, m* and Ar, measured in 


experiments. The term V,., therefore, can be omitted in (4.106). 


Problem 4.17 Find the explicit relations between the bare and renormalized pa- 
rameters. 


The potential (4.106) looks peculiar because it seems to depend explicitly on an 
arbitrary scale jz. However, it is easy to see that the change in yw induces a term 
proportional to m*(x) , which, for a renormalizable theory, has the same structure 
as the original potential V and hence leads to finite renormalization of constants. 
These constants, therefore, become scale-dependent (running), reflecting the renor- 
malization group properties of the quantum field theory. The physics remains the 
same; it is only our way of interpreting the constants that changes. For pure x4 
theory with m5 (X) = 3Ax7, we have 

1 


Ver = qh + 


O77) 
fs 4.108 
Bae Ye ee) 





where A = A(xo) and xo is some normalization scale. The logarithmic corrections to 
the potential are proportional to A? and become comparable to the leading .x* term 
only when A In(x/xo0) ~ O(1). At these large values of 7 , however, the higher-loop 
contributions we have neglected thus far become crucial. 


Problem 4.18 Requiring that potential (4.108) should not depend on xo, derive 
the renormalization group equation for A(x). Solve this equation, keeping only the 
leading term in the function, and verify that A(x) blows up when A(yo) In(x /x0) ~ 
O(1). 


Thermal contribution to Ver In a hot universe the field ¢ is no longer in its vacuum 
state. The occupation numbers n, are given by the Bose-Einstein formula (3.20), 
where € = ay = ,/k? + ms and the chemical potential can be neglected. Substitut- 
ing the Bose-Einstein distribution in (4.102), we obtain the following expression 
for the thermal contribution to (07): 


(oe) 


1 ke dk T? ay(mo(X) 

= . 4.1 

=a | ox eo /T a ( T .0) ( 09) 
0 
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In deriving this formula, we have changed the integration variable k > a, /T to 
express the result through the integral J” defined in (3.34). For thermal fluctuations 
the third term on the left hand side in (4.94) can be rewritten as 


1 om i he ave 
5V" (hy ie ptmeas Q) _ ae (4.110) 
where 
mg/T 
pial a J 0)d Ee eee Mes (4.111) 
% ~~ Ar? St mrs eel Ie 


0 


is the temperature-dependent contribution of scalar particles to V egy. 
The final result, which includes both quantum and thermal contributions, is 


my(X) mex) T* _ (/mo(X) 
n F_ , 
64:12 pe 4m? if 





Vers = V+ (4.112) 


where m3(X) = V(X). 


4.4.2 U(1) model 


Now we calculate the effective potential in a U(1) gauge model. The equation for 
the scalar field immediately follows from (4.44), 


x% + VQ) — e XG"G, = 0. (4.113) 


In this case the calculation of the contribution of the scalar particles to Veg, is a bit 
more complicated because the field x is unambiguously defined only for x > 0. 
To avoid the complications we consider, therefore, only the most interesting case 
when the contribution of the vector particles dominates that of the scalar particles. 
For the quartic potential in (4.107) this means e* >> A, that is, the mass of the 
gauge boson mg(X) = ex is much bigger than the mass of the Higgs particle. Note 
that the calculation of the one-loop contribution of the vector particles to Vege can 
still be trusted, even when it becomes comparable to the 1%* term. Neglecting the 
contribution of field ¢, we find that the homogeneous component of the scalar field 
satisfies the equation 


X%, + V(X) — eX (G"G,,) = 0. (4.114) 


The term (G“G ie can be calculated similarly to (7). and it is easy to show 
that 





(4.115) 


ve ORM an?) OR’ 
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where the integral J is defined in (4.104) and Vg is the energy density of the vacuum 
fluctuations of the vector field with mass 


mo(X) = ex. 


The factor 3 in (4.115) is due to the fact that the massive vector field has three degrees 
of freedom at every point in space. The calculation of the temperature-dependent 
contribution of the vector field essentially repeats the calculation for the scalar field 
and the final result, which includes both quantum and thermal contributions, is 


3me(X), me(x) . 3T* mg(X) 
Ver(x, 7) = V SEEN ee F_ : 4.116 
(XT) at 6472 ” pe 7 ad ( T ) ( ) 


where mg(x) = ex and F_ is defined in (4.111). 
At zero temperature the last term vanishes and for the quartic potential in ( 4.107) 
we obtain the following result: 





Ar me 52 3e4 
Vete = — A 
eff re ees aT RT 357 5x" 





(4.117) 


where the renormalized constants Ar, mr, Ar can be expressed through experi- 
mentally measurable parameters. The concrete set of these parameters depends on 
the normalization conditions used. 


Problem 4.19 Assume that the potential Vg has its minimum at some xp 4 0 
and is equal to zero at this minimum, that is, there is no cosmological constant in 
the broken symmetry phase. Solving the equations Ve(Xo) = 0, VJ(xo) = 0 and 


Vitel X0) = is verify that 











mi, et > me 3e° ye 
Re Oy2 3202? RO T6n?”’ 
X6 ue 67 
(4.118) 
Ap = Xo [min _ 3e'xo 
R — a . 
4,2 32707 


Thus we have expressed the renormalized constants in (4.117) in terms of xo, the 
gauge coupling constant e and the Higgs mass my. 


Given that in the broken symmetry phase 
Mc =™G(Xo) = exo, (4.119) 


we note that if my, < 3e? M2. /827, then m > 0, and the potential (4.117) ac- 
quires a second local minimum at x = 0. Moreover, for my < 3e7M Z /16xc7, this 
minimum is even deeper than the minimum at xg, because Ver(x = 0) = Ar < 0. 
Therefore symmetry breaking becomes energetically unfavorable. We will see later 
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that symmetry is restored in the very early universe. Hence, if the mass of the Higgs 
particle did not satisfy the inequality 


ms, > 3e°Me/16n, (4.120) 


known as the Linde—Weinberg bound, the symmetry would remain unbroken and 
the gauge bosons would be massless. 

Let us consider a special case: me = 0, or equivalently, V/i-(% = 0) = 0. Then 
it follows from (4.118) that potential (4.117) reduces to 


4 = 
Vert = sea ( zit = — aH + 7x8) (4.121) 
which is the Coleman—Weinberg potential. Such potentials may arise in unified 
particle theories. They are especially interesting for cosmological applications and 
can be used to construct the so called new inflationary scenario. 

Now we derive the asymptotic behavior of the potential in the limit of very high 
temperatures. To calculate F_ for large temperatures, T >> mg(X), one can use 
the high-temperature expansion (3.44) for J”. Then, taking into account (4.118), 
potential (4.116) reduces to 











= Moe IE og I ay 
Vel, DS ayo = 1 + A, (4.122) 
4 4a 2 
where 
2 4 2 
3 bT 
Ap = 4 Sn, nb = 21n4n-2€ 23.5 (4.123) 
2x9 16m" (exo) 
is the effective coupling constant and 
2 2 242 
2 _ © 72 2 2_ 2my — 3e°X9 
mz; =r -T), = By oe (4.124) 


is the temperature-dependent mass. Formula (4.122) is applicable only if mg = 
ex <T or, in other words, for x < T/e. 

Note that our investigation so far was based on the one-loop approximation. 
Higher order corrections can modify a detailed structure of the effective potential 
at small x. In particular, it can be shown that when account is taken of these 
corrections, the cubic term in (4.122) should be multiplied by the coefficient 2/3. 
This effect, however, goes beyond the scope of our consideration and will be ignored 
in what follows. 


Problem 4.20 Using (3.47) for J‘(a, 0), find the first few terms in the low- 
temperature expansion of effective potential (4.116). (Hint In this case it is more 
convenient to use m/T and oo as the limits of integration in (4.111). Why?) 
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Fig. 4.10. 


4.4.3 Symmetry restoration at high temperature 


Potential (4.122) is shown in Figure 4.10 for a few different temperatures. For very 
large T it has only one minimum at x = 0; the symmetry is restored and the gauge 
bosons and fermions are massless. When the temperature drops below 


To 


T, = ; 
1 —(9e4/1622A1,) 





(4.125) 





the second minimum appears; first it is located at x; = (3e3 T,/ 8ihr), and then it 
moves to the right as the temperature drops. The values of Veg at the two minima 
become equal at the critical temperature 











To 
T= : (4.126) 
{1 -(e/22Ar,) 
At this time the second minimum, located at 
_ _ @F, 
Xe (4.127) 


~ nar’ 
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is separated from x = 0 by a potential barrier of height, 


1274 
eT, 


= —_*_, 4,128 
(4a )* AF. } 


Cc 


with the maximum at x = x,/2. Note that the coupling constant 7 cannot be taken 
as arbitrarily small, because for large T the temperature corrections to it exceed 
e*. Hence, at the critical temperature the second minimum, located at x. < T/e, 
is always within the region of applicability of the high-temperature expansion. 
As the temperature drops below Tp, m7, becomes negative and the minimum at 
x = Odisappears. Finally, at very low temperatures (see Problem 4.20) the potential 
converges to (4.117). 


4.4.4 Phase transitions 


As the temperature drops below the critical value T., the minimum at x, 40 
becomes energetically favorable. Therefore, the field x can change its value and 
evolve to this second minimum. If at this time the two minima are separated by a 
potential barrier, the transition occurs with bubble nucleation. Inside the bubbles 
the scalar field acquires a nonvanishing expectation value. If the bubble nucleation 
rate exceeds the universe’s expansion rate, the bubbles collide and eventually fill all 
space. As a result, gauge bosons and fermions become massive. Such a transition 
is called a first order phase transition. It is very violent and one can expect large 
deviations from thermal equilibrium. 

The other possible scenario takes place if ¥ = 0 and x, 4 0 are never separated 
by a potential barrier. In this case, the field x gradually changes its value and the 
transition is smooth. It can be either a second order phase transition or simply a 
cross-over. As we have pointed out a second order phase transition is usually char- 
acterized by a continuous change of some symmetry. Because the gauge symmetry 
is never broken by the Higgs mechanism, we expect that in gauge theories a smooth 
transition is a cross-over. From the point of view of cosmological scenarios, a cross- 
over is not very different from a second order phase transition and for our purposes 
we simply have to distinguish a violent from a smooth transition. 

Let us now discuss what kind of transition one could expect in U(1) theory. To 
answer this question we consider the high-temperature expansion of Ve, given in 
(4.122). First of all we note that the barrier is due entirely to the x? term, which 
in turn appears because of the nonanalytic term « /(mg/ TY in (3.44) for JL. 
If the mass mg were zero, this term would be absent. Therefore, to establish the 
character of the transition, we need to know when we can trust our calculation of 
the x? contribution to Ver. As follows from (4.109), the temperature fluctuations 
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of the scalar field are about 


5 = ,/ ($2), ~ T/V24. 


For x < T/,/24, the vector bosons can no longer be treated as massive particles. 
Therefore, for this range, the perturbative consideration of the mass corrections in 
le fails and one expects the ¥* term to be absent. 

The following simple criteria provide a sense of when the barrier will be present: 
if, at critical temperature 7), the value of the scalar field at the expected location of 
the barrier maximum, 


X./2 = &T./40 Ar, 


exceeds T/./24, then the barrier really exists. Indeed, in this case the calculation 
of x? term is reliable. Thus, we conclude that if the coupling constant A is small 
enough, namely 

J6 ; 9 , 


a >h> 16n2° ; (4.129) 


the maximum of the barrier is located at 
T./e > X¢/2 > T./V 24 


and the first order phase transition with bubble nucleation should take place. Because 
the Higgs mass is proportional to A, this situation can be realized only if the Higgs 
particle is not too heavy. 
On the other hand, if the coupling constant is large, 
v6, 


eee ae (4.130) 
20 


the barrier should be located at 
Xe/2 < T./V 24. 


However, for this range of x, the bosons should be treated as massless particles and 
the x? term should be left out of the potential. Thus, we expect that the barrier does 
not arise at all and the effective potential changes as shown in Figure 4.11. In this 
case the symmetry breaking occurs smoothly via a gradual increase of the mean 
value of the scalar field. Therefore, the transition has no dramatic cosmological 
consequences. We remind the reader that the contribution of the scalar particles can 
be neglected only if the first inequality in (4.130) is fulfilled. 

The criteria derived above are no more than rough estimates. However, more 
sophisticated analysis shows that these estimates reproduce the more rigorous re- 
sults rather well. 
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Fig. 4.11. 


4.4.5 Electroweak phase transition 


The above considerations can easily be generalized to study the electroweak phase 
transition in the early universe. In electroweak theory, the equation for the scalar 
field is obtained by variation of x-dependent terms in the electroweak Lagrangian 
given in (4.59), (4.79) and (4.84). If we assume that the Higgs mass is small and 
neglect the scalar particles, then the equation for the homogeneous field x is 


7 ae oe ie 
4 


2 

x +V'O q(Z,2°\= SK (WiW-H) + fi?) =0. (4.131) 
We have retained here only the top quarks, which dominate over the contributions 
from the other fermions because of their large Yukawa coupling constant f;. The 
contributions of Z and W bosons to Ve can immediately be written down using 
the formulae derived in the previous section. We simply note that the charged W 
bosons have twice as many degrees of freedom as the neutral vector field and hence 
give twice the contribution to Veg. 
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Problem 4.21 Verify that f; (tf) = 0V,/0x, where 





_ I(m;,) T* mM; 
vesxa(- oa +ooF(F) (4.132) 
Here m, = f,x and 
m,/T 
Hee Vg ) 
F,( —) - i aS (a, 0) da. (4.133) 


0 


The integrals J(m,) and wad (er 0) are defined in (4.104) and (3.34) respectively. 
The factor 3 in (4.132) accounts for the three different colors of t quarks, the factor 
4 for four degrees of freedom of the fermions of each color, and the negative sign 
in front of J indicates that the vacuum energy density of fermions is negative. 


Using this result we obtain 
3 m?, pe 2 
Vesties Vir eam MZ +2m my In — 4m min) 
are 
+75 |[F-(F 2) +27_(=% r)\+4r.(=)I, (4.134) 


4n2 


where 


sf Bo Ae Be g_ 7 


mz =-——~—X, mMw= =X, m= fix. 


2 2 


This formula resembles (4.116) and the numerical coefficients in front of the differ- 
ent terms can easily be understood by counting the number of degrees of freedom 
of the corresponding fields. 


Problem 4.22 Using the same normalization conditions as in Problem 4.19, verify 
that at T = 0 K the effective potential is given by (4.117) and (4.118), where we 
have to substitute 


oe M}+2Mj, — 4M 
e 
Xo 





Since in the broken symmetry phase the masses of the gauge bosons and t quark are 
Mz =mz(Xo) X 91.2 GeV, Mw ~ 80.4 GeV and M, ~ 170 GeV, this combina- 
tion of masses is negative and the Linde—Weinberg arguments do not lead to a lower 
bound on the Higgs mass. However, in this case the top quark contribution causes 
the coefficient in front of the logarithmic term in (4.117) to be negative. At very 
large x this term dominates and the potential becomes negative and unbounded 
from below. 
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Taking different values for the Higgs mass (10 GeV, 30 GeV, 100 GeV), find out 
when the potential Ver becomes negative. In this way, assuming that the standard 
electroweak theory is valid up to the scale x,, and requiring the absence of the 
dangerous second minimum in Veg at very large x > xo, one can obtain a lower 
bound on the Higgs mass. However, this bound is not as robust as the Linde— 
Weinberg bound. 

The high-temperature expansion of the potential (4.134) is derived using the 
same methods as for (4.122). The result is 

fing: Orns gin dy) x2 
qh — a 5 i a 


where the temperature-dependent coupling constant 


Vere(X T)& + Ar, (4.135) 





2 2 2 2 

3 bT bT bpT 

hip ena(M In —, + 2My In —- — 4M" In —, ) (4.136) 
2x6 = 1622x5 M; My M; 


is expressed through the masses of the gauge bosons and f quark in the broken 
symmetry phase, e.g. Mz = mz(Xo). The constant b is defined in (4.123) and 
Inbr = 2Inz —2C ~1.14. 

The dimensionless constants © and Y are 
3(Mz + 2Mj, Mz +2Mj, +2M? 
aE (02, BSS es 08 


06= 
An x; 4x6 





and the temperature 





1 / , 3(MZ+2My, —4M?) 
yah 877 x6 


~ 1.7(mi, +(44 GeV)’) (4.137) 


depends explicitly on the unknown Higgs mass. 

The terms due to the vector bosons are of the same type as in (4.122). This 
becomes clear upon rewriting (4.122)-(4.124) using Mg = exo instead of the 
coupling constant e. To find the contribution of the fermions we have used the high- 
temperature expansion (3.44) for ai Note that this expansion does not contain a 
nonanalytic term which would contribute to the numerical coefficient © in front of 
the x? term. 

Now we turn to the temperature behavior of potential (4.134) and study the 
symmetry breaking in the early universe. To get an idea of the expected character 
of the transition it is enough to consider the high-temperature expansion (4.135). 

For a given temperature 7, the contribution from different fields to ( 4.135) 
can be trusted only for those x for which the induced masses of the corresponding 
fields are smaller than the temperature. For instance, the ¢ quark terms should be 
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retained in (4.135) only when m;(x) = M;x/xo0 < T, that is, for x <(xo/M,;) T. 
Within the range 


T 
Mzw 





T 
x — Xo, 4.138 
Xxo>X> M, Xo ( ) 


the Z and W bosons are relativistic, while the ¢ quarks are nonrelativistic and hence 
their contribution should be omitted. As we have mentioned above, for very small 
x our derivation fails and we have to use more refined methods. 

The analysis of the potential behavior follows almost exactly the consideration 
in the previous section. At very high temperatures, namely 


To 
{1 -(©2/47 Ar.) 
the potential has one minimum only, at x = 0, the symmetry is restored and gauge 


bosons and fermions are massless. As the temperature drops below 7), the second 
minimum appears and at 


T>T]= 





(4.139) 














To 
T. = (4.140) 
{1 -(262/9T Ar) 
the depth of this second minimum, located at 
20T 
X= ; (4.141) 
3Ar. 


becomes the same as that of the minimum at x = 0. Subsequently, the transition 
to the broken symmetry phase becomes possible. As noted above, the minimum 
at X, is separated from x = 0 by the barrier only if x./2 > T//24. Hence, we 
expect a strong first order phase transition only if Az, < ./8/30. Using (4.136), 
this condition can be rewritten in terms of the upper bound on the Higgs mass: 
my < 75 GeV. Thus, only if the Higgs particles were light enough would the 
electroweak phase transition be first order. For my = 50 GeV, this would occur at 
temperature T, ~ 88 GeV. 

The experimental bound on the Higgs mass is my > 114 GeV. Therefore in 
reality we expect that the breaking of electroweak symmetry happens smoothly. 
For large Higgs masses one can simply neglect the ¥* term in (4.135). In this case, 
when the temperature drops below 7 the only minimum of the potential is located 
at 


v 1/2 
io (<8 ~ r°)) (4.142) 
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and the symmetry is broken. The transition is a cross-over with no dramatic cos- 
mological consequences; in particular, no large deviation from the thermal equilib- 
rium. The temperature at transition depends on the Higgs mass and it follows from 
(4.137) that, for instance, Ty) ~ 166 GeV for my ~ 120 GeV and Jp ~ 240 GeV 
for my ~ 180 GeV . The above estimates are in good agreement with the results 
of more rigorous and elaborate calculations. 


4.5 Instantons, sphalerons and the early universe 


In gauge theories which incorporate the Higgs mechanism the vacuum has a non- 
trivial structure in several respects. First of all, as we have seen in the preceding 
section, in the early universe the effective potential of the Higgs field can have two 
local minima. If the transition from the symmetrical phase to the broken symmetry 
phase happens when the false and true vacua are still separated by a barrier, the 
transition is first order and accompanied by bubble nucleation. Although this sit- 
uation seems not to occur in either quantum chromodynamics or the electroweak 
model, it is rather typical for unified field theories beyond the Standard Model. 

Another interesting aspect of non-Abelian gauge theories is the existence of 
topologically different vacua. They are also separated by a barrier and topological 
transitions can take place in the early universe. These transitions are very important 
because they lead to anomalous nonconservation of fermion number in the Standard 
Model. 

If the temperature at the time of transition is small compared to the height of the 
potential barrier, the transitions between different vacua occur as a result of subbar- 
rier quantum tunneling. In this case the Euclidean solution of the field equations, 
called an instanton, gives the dominant contribution to the tunneling probability. 
On the other hand, if the temperature is high enough, thermal fluctuations can take 
the field over the barrier to the other vacuum without tunneling. In this situation the 
transitions are classical and their rate is determined by the static field configuration 
corresponding to the maximum of the potential, called a sphaleron. 

In this section we will calculate the rates for the false vacuum decay and for topo- 
logical vacuum transitions and determine under which conditions the sphalerons 
dominate the instantons and vice versa. 


4.5.1 Particle escape from a potential well 


To start with, let us consider as a “warm-up” the one-dimensional problem of a 
particle of mass M escaping from a potential well at go = 0 (Figure 4.12). In this 
simple case we meet the main concepts we need for analyzing transitions in field 
theory. 
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Fig. 4.12. 


First, we neglect the thermal fluctuations and assume that the particle, with fixed 
energy E < V(q,), is initially localized in the potential well. The only way to 
escape from this well is via subbarrier tunneling. If the tunneling probability is 
small, the energy E is an approximate eigenvalue of the Hamiltonian, and we can 
use the stationary Schrédinger equation 


1 9? 
—-~—— + V Ur EV 4.143 
( 2M aq? + )) ( ) 
to estimate the tunneling amplitude. The approximate semiclassical solution of this 
equation is 


Ua exp(i i V2M(E — Vida) : (4.144) 


In the classically allowed regions, where E > V, the wave function simply os- 
cillates, while under the barrier it decays exponentially. Hence, in the region 
q > b(E), the wave function is suppressed by the factor 


KE) 
exp - | amor Bag (4.145) 
aE) 


compared to its value inside the potential well. Expression (4.145) accounts for the 
dominant exponential contribution to the tunneling amplitude. 
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Instantons In the special case of E = 0 (the particle is at rest at the local minimum 
of the potential) we have 


b(0) Tp 


1 
/ V/2MVdq = / (sua? 4 v) dt = Sy), (4.146) 
0 —oo 
where q(t) satisfies the equation 
a(—V 
Mg@+ gov) = 0. (4.147) 
oq 
Integrating (4.147) once gives 
Mq’?/2—V =0. 


Taking into account time-translational invariance, we can set t, = 0 in (4.146) 
without loss of generality. 

Equation (4.147) describes the motion of the particle in the inverted potential 
(—V) (Figure 4.12), and can be obtained from the original equation of motion if 
we make the Wick rotation t > t = it. Note that the formal substitution t = —it 
converts the Minkowski metric 


ds? = dt? — dx’ 
to the Euclidean metric 
—ds?, = dt* + dx’. 


Therefore, t is called “Euclidean time.” The right hand side of (4.146) is the 
Euclidean action calculated for the trajectory satisfying (4.147), with boundary 
conditions g(t — —oo) = 0 and q(t = 0) = D(0). We can “close” this trajectory 
by considering the “motion” back to g = 0 as t — +00. The corresponding so- 
lution of (4.147), with boundary conditions g(t — -+oo) = 0, is a baby version 
of the Euclidean field theory solutions called instantons. It is clear from symmetry 
that the instanton action S7 is just twice the action So). Hence, for the ground 
state (E = 0), the dominant contribution to the tunneling probability, which is the 
square of the amplitude (4.145), is 


P; x exp(—Sy;). (4.148) 


Thermal fluctuations and sphalerons Now we consider a particle in equilibrium 
with a thermal bath of temperature T. The particle can acquire energy from the 
thermal bath and if this energy exceeds the height of the barrier the particle escapes 
from the potential well classically, without needing to “go under the barrier.” The 
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probability that the particle gets energy E is given by the usual Boltzmann factor 
x exp(—E/T). Taking into account that for E < V(gm) the tunneling amplitude is 
given by (4.145), we obtain the following result for the total probability of escape: 


b(E) 
E 
Px) exp eee / J2M(V — E)dq |, (4.149) 
E 
a(E) 


where 3 = 1 for E < V(qm) and 0 = 0 otherwise. The sum in (4.149) can be esti- 
mated using the saddle point approximation. Taking the derivative of the expression 
in the exponent, we find that this expression has its maximum value when E satisfies 


the equation 
b(E) D(E) 
= =2 f La ee me fee (4.150) 
me Vv — Ey! q. 


a(E) a(E) 


Here g = dq/dt is the “Euclidean velocity” along the trajectory with total Eu- 
clidean energy — E. The term on the right hand side of (4.150) is equal to the period 
of oscillation in the inverted potential (—V). Hence, for a given temperature T, 
the dominant contribution to the escape probability gives the periodic Euclidean 
trajectory describing oscillations with period 1/T in the potential (—V). 

Let us consider two limiting cases. From (4.149) and ( 4.150), it follows that 
for T < V(qm) /S; the main contribution to the escape probability comes from 
the subbarrier trajectory with E < V(qm) and as a result the rate of escape is 
determined by the instanton. In the opposite case of very high temperatures, 


Vm) 


T> ; 
Si 





the “period of oscillation” tends to zero; hence the dominant trajectory comes 
very close to the top of the potential V and has energy E ~ V(q,,) . The unstable 
Static solution g = q», which corresponds to the maximum of V, is a prototype of 
the field theory solutions called sphalerons (Greek name for “ready to fall’). For 
the sphaleron, the second term in the exponent in (4.149) can be neglected and the 
escape probability is given by 


E, 
P« exp(— 2) (4.151) 


where E,,;, = V(qm) is the energy (or mass) of the sphaleron. We would like to 
stress that at very high temperatures the main contribution to the escape probability 


184 The very early universe 


comes from the states which surmount the barrier classically. For T > Espn there is 
no exponential suppression and the particle very quickly leaves the potential well. 

So far we have considered a particle with only one degree of freedom. For a 
system with N degrees of freedom the potential can depend on all coordinates 
gq =(41, 92, ---, Gn). The generalization to this case, however, is rather straightfor- 
ward. To calculate the tunneling probability at low temperatures we have to find an 
instanton with the minimal Euclidean action S;. If the energy is normalized so that 
V = 0 at the bottom of the potential well, the dominant contribution to the tun- 
neling probability is proportional to exp(— $7) . At very high temperatures we need 
to find the local extrema of the potential through which the particle can escape. 
The extremum with the minimal value of the potential determines the dominant 
contribution to the escape probability. This probability is given by (4.151), where 
Espn is the value of the potential at the corresponding extremum. 


4.5.2 Decay of the metastable vacuum 


We consider a real scalar field in this section using the standard notation ¢ instead of 
x and for simplicity neglect the expansion of the universe. Let us assume that at the 
time relevant for the transition the potential V (g) has the shape shown in Figure 4.13. 
For convenience we normalize the energy such that V(0) = Oand V(@p) = —e < 0. 
Obviously the state g¢ = 0 is metastable and decays. If the transition takes place 
efficiently when the temperature is small, we can neglect thermal fluctuations and 


vA 
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the metastable vacuum decays via quantum tunneling. If thermal fluctuations are 
not negligible on the other hand, they can push the field to the top of the potential 
and the transition occurs classically without tunneling. In both cases critical bubbles 
are formed, filled with the new phase g + 0. If the bubble nucleation rate exceeds 
the expansion rate of the universe, the bubbles collide and finally fill all space with 
the new phase. Let us now calculate the decay rate of the metastable (false) vacuum. 

The scalar field g(x, f) is a system with an infinite number of degrees of freedom. 
We can treat the spatial coordinates x as a continuous index enumerating the degrees 
of freedom. In this case g(x, t) = ¢ x(t) plays the same role as q,,(t) in the preceding 
discussion (the correspondence is obvious: g <=> q and x <=> n). The action for 
the scalar field can be written as 


S= fo —v)dt, (4.152) 


1/d%\? 1 [8% \ 1 
— = 4.1 
: ZG) at Gr) @ ae 


is the kinetic energy and 


where 





V(gx) = i; (500? + vi) dx (4.154) 


is the potential energy or potential of the scalar field configuration g,. As usual, 0; 
denotes the partial derivative with respect to the spatial coordinate x' which, in the 
language adopted in this section, is the derivative with respect to the continuous 
index. We must stress that in the study of tunneling in field theory the potential V, 
and not the scalar field potential V(g), plays the role of V(q) from the previous 
discussion. To avoid confusion, the reader must distinguish between them carefully. 
The potential V(@,) is a functional. It depends on the infinite number of variables 
yx and takes a definite numerical value only when the field configuration g(x) is 
completely specified. 


Decay via instantons For the scalar field potential V(g), shown in Figure 4.13, the 
state p(x) = 0 corresponds to a local minimum of the potential V with V(O) = 0. 
The other static configuration g(x) = go has negative energy V(go) = —€x vol- 
ume. Therefore the state ¢ = 0 is metastable and should decay. This decay can 
be described, in complete analogy with the preceding discussion, as an “escape” 
via tunneling of the infinitely many degrees of freedom from the local poten- 
tial well in V. The dominant contribution to the semiclassical tunneling probabil- 
ity, proportional to exp(—S7;), comes from the instanton with the action $7. This 
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instanton “connects” the metastable vacuum g(x) = 0 to some (classically allowed) 
configuration gx with V(¢x) < 0, and satisfies the equation 








5(-V 
x(t) + aed = 0, (4.155) 
5px 
where (x = 0°y/dt” and 
5(—YV) 
= Ag — V 
59x a 


is the functional derivative of the inverted potential. Equation (4.155) is an analog 
of (4.147) and it is obtained from the usual scalar field equation in Minkowski space 
under Wick rotation tf > t =it. 

The Euclidean action is finite for those solutions describing tunneling in which 
the field g changes its value from g = 0 to g 4 0 only within a bounded region in 
space. On symmetry grounds one expects the most favorable emerging configuration 
of the scalar field to be a bubble with g, 4 0 at its center and g — O far away from 
the center (Figure 4.14). To find the corresponding instanton relating the original 
metastable vacuum configuration g(x) = 0 to a bubble filled with a new phase we 
can again rely on symmetry. That is, we adopt the most symmetrical O(4)-invariant 
solution of the Euclidean equation (4.155), which describes the four-dimensional 
spherical “bubble” in Euclidean “spacetime.” The scalar field then depends only 
on the radial coordinate 


yp 0 
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Fig. 4.14. 
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For g(t) = g(F) (4.155) simplifies to the ordinary differential equation 


dp 3dy av 
_ = 0. 4.156 
di? FF dF dy ( ) 


The solution of this equation, with boundary conditions g¢ — 0 as *f — o and 
dg/dr =0 at ¢ = 0, gives the desired instanton. The second boundary condition 
is needed to avoid a singularity at the center of the bubble. 

The vacuum decay rate per unit time and unit volume is 





Tl ~ Aexp(—S;,), (4.157) 


where A is a pre-exponential factor that is very difficult to calculate. We can obtain 
a rough estimate for A using dimensional arguments. In units where c = h = | and 
G # 1, the decay rate has the dimension cm~* and hence 


Aro Vege) 


(4.158) 


where R, is the size of the instanton and one uses typical instanton values for the 
derivatives of V. Usually all these quantities have the same order of magnitude. 


Problem 4.23 Verify that the Euclidean action for the instanton can be reduced to 


1 (do? 
Se a= {(5(Z) q. v) Par. (4.159) 
r 


(Hint The Euclidean action S¢ is related to the Lorentzian action S;, given in 
(4.152), by S$, > iSg ast > Tt =it.) 


Decay via sphalerons The above consideration is applicable only at low or zero 
temperature. If the temperature is very high (we will specify later what this means), 
the dominant contribution to the vacuum decay is given by over-barrier classical 
transitions. To estimate their rate we have to find the extremum of the potential V 
with the minimal possible energy. This extremum is reached for the static scalar 
field configuration (sphaleron), satisfying the equation 


6V/d59 = —Ag+ Vy, =0. (4.160) 
For a spherical bubble this becomes 


@y dp av 
dr? or dr ap 





0, (4.161) 


where y = g(r) andr = |x| is the radial coordinate in three-dimensional space. The 
boundary conditions we need to impose are similar to those for (4.156), namely, 
yg > Oasr— o and dg/dr = 0 at r = 0. Note that the sphaleron, in contrast 
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with the instanton, is the unstable static solution which depends only on the spatial 
coordinates x. 

For transitions dominated by sphalerons, the decay rate per unit time and unit 
volume is 





TB exp “ut ) (4.162) 


Vs i ki bay 24 (4.163) 
sph = OTC a\ Gr rar : 


is the energy (or mass) of the sphaleron and B ~ O(T4, ie :) is a further pre- 
exponential factor. 

Using the same reasoning as for particle escape, we find that for T < Vypn/S7 
tunneling is more important and to estimate the vacuum decay rate one has to use 
(4.157); otherwise, for T >> V,p;,/S7 the decay rate is given by (4.162). 

Thus, to calculate the vacuum decay rate we have to find the solution of either 
(4.156) or (4.161), depending on the temperature. One must usually resort to nu- 
merical calculation; however, for a wide class of scalar field potentials it is possible 
to find the explicit expression for I’ without specifying the shape of the potential 
V. 


where 


Thin wall approximation The first integral of (4.156) is 


1(dp\? (3 (dev 
5(= -v=[5 +) ar, (4.164) 
2\ dr JP art 


where the boundary condition ¢ — 0 as7 — oo has been used. Taking into account 
the other boundary condition (dg/d7);_) = 0, we obtain the useful relation 


3 (dg 


2 
—V(eF =0)) = / =(3) df’. (4.165) 
0 


Now let us assume that the instanton, which is a bubble in four-dimensional Eu- 
clidean “spacetime,” has a thin wall. This means that inside the bubble of radius R; 
the scalar field is nearly constant and equal to y(7 = 0). The field g changes very 
fast within the wall — a thin layer of width 2A/l < R, — and tends to zero outside 
the bubble (Figure 4.14). Hence the integrand in (4.165) differs significantly from 
zero only inside the wall, and this gives the dominant contribution to the integral. 
Returning to (4.164) we see that the integral on the right hand side is suppressed 


4.5 Instantons, sphalerons and the early universe 189 


inside the wall by a factor Al/R; < 1 compared to (dg/d?)’ and V. Therefore, in 
the leading approximation we have 


(dg/d?y ~2V 


for R; + Al > F > R; — Al. Using this result, (4.165) simplifies to 


30 
—V(G;=0) ¥ >, (4.166) 
Ry 
where 
(oe) 2 PF=0 
doy “5 
c= af dr & V2Vdo (4.167) 
0 0 


is the surface tension of the bubble. The instanton action (4.159) then becomes 


2 2~4 

A 211-0 
Sp VG RP ro RR? Ss 
2 : *  21Vg;=0)/? 


where the first and second terms are the contributions of the internal region of the 
bubble and its wall respectively. For the potential V shown in Figure 4.13 the action 
takes the minimal value for | V (¢;—o)| = €. In this case the field inside the bubble is 
equal to gp and, as follows from (4.166), the instanton has size R; ~ 30/¢e. Hence 
the vacuum decay rate is equal to 


(4.168) 





a7 DA 
= 1 (4.169) 


T~ Aexo(- 73 
€ 


At a “given moment of Euclidean time” t the solution o(v r2 +12) describes 
a three-dimensional bubble. The half of the instanton connecting the metastable 
vacuum with the classically allowed region “evolves” in Euclidean time as follows. 
Att — —oowe have g = Oeverywhere in space. “Later on” att ~ — R; the bubble 


“appears” and its radius “grows” as R(t) © ,/ R? — t? reaching the maximal value 
R,; att =0. 


Problem 4.24 Calculate the potential energy of this bubble and verify that 


V(R(t)) © ** R(O)(R} — R(r)). (4.170) 


The total energy corresponding to the instanton solution is equal to zero. There- 
fore, a bubble with radius 0 < R(t) < R, isin the classically forbidden region (un- 


der the barrier), where V(R) > 0. A three-dimensional bubble of size R; + 30/e 
is on the border of the classically allowed region (V(R;) = 0) and it “materializes” 


190 The very early universe 


in Minkowski spacetime. The picture described is a very close analog of subbarrier 
particle tunneling. 

To understand when the thin wall approximation is valid we must determine 
when the condition A//R; < 1 is satisfied. If V,, is the height of the scalar field 
potential, the positive energy inside the wall of the “emerging bubble” is of order 
Vin RZ Al . This is exactly compensated by the negative energy inside the bubble 
~ ER? where —e is the global minimum of the potential. Therefore, Al/R ~ 
€/V,, and hence the thin wall approximation is applicable only if €/V,, < 1. 

The thin wall sphaleron which determines the decay rate at very high tempera- 
tures can be found in a similar way from (4.161). 


Problem 4.25 Verify that the thin wall sphaleron has size R,,;, ~ 20/€ and mass 
equal to 





1620? 
Vsph x 52 . (4.171) 
Thus, we find that if «€/V,, < 1, the vacuum decay rate is about 
1620° 
T~B — 4.172 
exo( 362T ) ( ) 


for T > Vspn/S1 ~ Robe On the other hand, for T << Roh ~ Be: the vacuum 
decay rate is given by (4.169). 

After a bubble has emerged in Minkowski spacetime its behavior can easily 
be found if we analytically continue an appropriate solution of (4.156) back to 
Minkowski spacetime. The corresponding function g(./r2 — t?) describes the ex- 
panding bubble. The field g is constant along the hypersurfaces r? — t? = const. 
Figure 4.15 shows these hypersurfaces and makes it clear that from the point of 
view of an observer at rest, the thickness of the wall decreases with time and the 
speed of the wall approaches the speed of light. 


Problem 4.26 Using the results of this section, derive and analyze the formulae 
describing first order phase transitions in the U(1) model considered in Section 
4.4.4. 


4.5.3 The vacuum structure of gauge theories 


In non-Abelian gauge theories the vacuum of gauge fields has a nontrivial structure. 
To understand how this comes about, we first consider pure SU(N) theory without 
fermions and scalar fields. The gauge field F,,,, should vanish in the vacuum. This 
does not mean, however, that the vector potential A,, also vanishes; the vanishing of 
F,, only means that the vector potential is a gauge transform of zero (see (4.11)). 
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Fig. 4.15. 


In particular, taking an arbitrary time-independent unitary matrix U(x), we find 
that 


Ay = 0, A; = —(3;U) U7! (4.173) 
& 


also describes the vacuum. If an arbitrary U(x) could be continuously transformed 
to the unit matrix everywhere in space all vacua would be equivalent. However 
this is not the case. Instead the set of all functions U(x) can be decomposed into 
homotopy classes. We say that two functions belong to the same homotopy class if 
there exists a nonsingular continuous transformation relating them; otherwise the 
functions belong to different homotopy classes. 


Winding number To find and characterize the homotopy classes let us introduce 
the winding number, defined as 





=— a i tr(e'/*(;U) U!(a;U) U- (aU) U') dx, (4.174) 
where e” is a totally antisymmetric Levi—Civita symbol: ¢!7? = 1 and it changes 
sign upon permutation of any two indices. We will show first that this number is a 
topological invariant and second that it takes integer values characterizing different 
homotopy classes. To prove the first statement let us consider a small nonsingular 
variation U — U + 6U, and show that dv = 0. Taking into account that 


(SU) U7! = —U(sU~') 
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and 
(a,U) U-! = —U(a;U~"), (4.175) 
the variation of the first term in the integrand can be written as 
5(a,U) U-') = U(a;(U-'5U)) UT. (4.176) 


The variation of the other two terms gives a similar contribution, and therefore 
dv x / tr(e/*d;(U-'5U) (a;U~')(a,U)) d?x. (4.177) 


Upon integration by parts there arise terms of the form ¢!/*9; 0; ..., which vanish 
because of the antisymmetry of the ¢ symbol. Hence the winding number does not 
change under continuous nonsingular transformations. 

To prove the existence of homotopy classes with different winding numbers, we 
construct them explicitly for the case of the SU(2) group. Any SU(2) matrix can 
be written as 


U(x, e) =cosmx1—i(e-o)sinmx, (4.178) 


where e =(e1, €2, €3) is the unit vector and o =(o01, 02, 03) are the three Pauli 
matrices combined as a vector. 


Problem 4.27 Verify this last statement. (Hint An arbitrary SU (2) matrix has the 
same form as the matrix ¢ in (4.47).) 


Thus, the elements of the SU(2) group can be parameterized by the unit vector 
in four-dimensional Euclidean space, 


ly =(cosmx,esinmy), 


and they can be thought of topologically as the elements of the three-dimensional 
sphere. Let us take x and e to be functions of the spatial coordinates x and identify the 
points at spatial infinity (|x| — oo). Then U(x) is an unambiguous function of the 
spatial coordinates if x (|x| — oo) = wz and m is an integer or zero. We can interpret 
the function U(x) as describing the mapping from the 3-sphere S* (Euclidean space 
x with infinity mapped to a point) to the 3-sphere of the elements of the SU(2) 
group. For the mapping (4.178), the spatial coordinates x wrap around this sphere 
m times. The mappings with +m and —m correspond to different orientations and 
therefore should be distinguished. Using the identity (4.175), (4.174) simplifies to 


1 


—sa ft {U~'(9,U) [(a,U~')(aU) —(d,U~')(8,U) ]} drdédy, (4.179) 
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where r, 0 , g are the spherical coordinates in x space. Let us assume for simplicity 
that x(x) = x(r) and 


e(x) =(sin 0 cos g, sin @ sing, cos @). 
Then, taking into account that U-!() = U(—x) and using the Pauli matrix property 
O;0; = bj; + 1&j;K0, 
from which follows (eo) = 1, we obtain 


d 
U-'(a,U) = —im(eo) a (4.180) 
r 


(a,U-')(aU) —(d,U~')(a,U) = —2i sin’(mx) sin Oe). (4.181) 


Substituting these expressions into (4.179) and integrating over the angles, we 
derive the desired result: v = m. Thus the homotopy class characterized by the 
winding number m corresponds to the mapping which wraps m times around the 
SU(2) sphere. 

To what extent are these results particular to the SU(2) group? First of all note 
that every mapping of S? into the U(1) Abelian group is continuously deformable 
to the trivial mapping. Hence the vacuum has a trivial structure in this case and 
there is no analog of winding number. As for a non-Abelian SU(N) group, one can 
show by methods beyond the scope of this book that any continuous mapping of S? 
into an arbitrary SU(N) group can be continuously deformed to a mapping into an 
SU(2) subgroup of SU(N). Therefore all results derived for the SU(2) group are 
valid for an arbitrary SU(N) group; in particular, (4.174) for the winding number 
requires no alteration. 


Barrier height Two vacua, with different winding numbers, are separated by a 
barrier. To demonstrate this we will use the identity 


E y) 
tr(FF) = a, tre ( Rp As - sieApA,As) ’ (4.182) 


where F*? = +e%679F.,5 is the tensor dual to F. 
Problem 4.28 Verify (4.182). 


Let us consider two vacuum configurations (4.173) with winding numbers vp and 
v, specified on two different space-like hypersurfaces. Then, integrating (4.182) 
and using the Gauss theorem, we obtain 


Z 1627 
/ tr(FF) d*x = s (vp = v9). (4.183) 
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Thus, the field configuration interpolating between two topologically different 
vacua has a nonvanishing field strength and hence, “in between,’ nonzero posi- 
tive potential energy. Because the energy of both the initial and final states is equal 
to zero, the transition between the vacua can occur only as subbarrier tunneling 
via an instanton. To find the corresponding instanton we make the Wick rotation 
to Euclidean time: f — t = it and substitute Ag — iAg; then Fo; — iF; and the 
form trF* becomes nonnegative definite. By the Schwartz inequality, 


( for) a'x) ( foe) a's) > ( [ wee) a's) 


Taking into account that tr(F?) = tr(F?), this inequality together with (4.183) im- 
plies the lower bound for the Euclidean action of any field configuration connecting 
the two vacua: 


2 
(4.184) 





1 8707 

SE = 5 | se) d*x > ra |vy = Vo| f (4.185) 
The equality in (4.184) is attained if and only if F = +F. The corresponding inter- 
polating solution with Av = 1 is called the instanton. We do not need the explicit 
form of this solution here, but merely point out that it is characterized by a single 
parameter (a constant of integration), the instanton size p. The instanton action 
does not depend on the size and for any p is equal to 

877 _ 20 


cs. S (4.186) 
+4 Qa 


where a = g*/4z7 is the corresponding “fine structure constant.” 


Topological transitions Thus, the vacuum of a gauge theory generally has a com- 
plicated structure with many minima separated by potential barriers as shown in 
Figure 4.16. (This picture is, of course, no more than a symbolic representation of 
the vacuum structure and should not be taken too literally.) The instanton connects 
two adjacent minima. The probability of tunneling is proportional to exp(—257) 
because as opposed to the particle tunneling, the instanton interpolates between the 
initial and final states only once. Hence the transition rate between topologically 
different vacua is 


4 
rx exo(- =) (4.187) 
a 


In electroweak theory, w,, ~ 1/29 and we have I « 107!®°. Therefore, instanton 
transitions are strongly suppressed in electroweak theory. 
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Fig. 4.16. 


The existence of instantons of arbitrary size is not surprising since pure Yang— 
Mills theory is scale-invariant. Naively integrating over all instantons leads to a 
divergent transition probability. Moreover, the height of the barrier between two 
minima, which can be estimated on dimensional grounds as 

Vig? oe (4.188) 
p 
tends to zero as the instanton size p grows. Hence, one might expect the transi- 
tion rate to be already very large at very small temperatures. In reality this does 
not happen because our consideration fails for instantons of large size. In fact, in 
pure non-Abelian Yang—Mills theory the gauge fields are confined and, hence, the 
instanton size cannot exceed the confinement scale. 

Let us now turn to theories with the Higgs mechanism, where scale invariance is 
broken. For instance, in electroweak theory one has a natural infrared cut-off scale 
determined by the typical mass of the gauge bosons My, and a maximal instanton 
size equal to 


pm ~ Mj}. (4.189) 


Vacuum transitions are strongly suppressed at zero temperature because the weak 
coupling constant is small. At high temperatures the transition rate is determined 
by sphalerons corresponding to the maximum of the potential (Figure 4.16). We 
can make a rough estimate of the sphaleron mass by considering the height of the 
potential in the “direction” where tunneling occurs via an instanton of the largest 
possible size p,, ~ My’. Then 


Mwy 
Myo ~ Vn ~ Si/Pm ~ 20 —“ ~ 15 TeV. (4.190) 
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This estimate is in good agreement with more elaborate calculations according to 
which Msp; ~ 7—13 TeV. As in the case of metastable vacuum decay, the sphaleron 
size R;pyp, is comparable to the instanton size p, ~ M a 

At high temperatures the rate of topological transitions is proportional to 
exp(—M,,,/T) and one expects that at T > 10 TeV they are no longer suppressed. 
In fact, the transitions become very efficient at much smaller temperatures. We 
have found that the expectation value of the Higgs field, and hence the masses of 
the gauge bosons, decrease as the temperature increases. As a consequence the 
height of the barrier, proportional to My(T), also decreases. At the moment when 
Mw(T) ~ a,T, the exponential suppression, 


Msp(T) Mw(T) 
exp FF ~ exp ere ; 


disappears and the rate of transitions per unit volume per unit time is 
PAR Gait) (4.191) 


This estimate, based on dimensional grounds, is also roughly valid at higher temper- 
atures where symmetry is restored, the gauge bosons are massless and the barrier 
disappears. In the absence of the barrier there are no sphalerons and transitions 
occur via field configurations of typical size ~ (a,,T )~'. In electroweak theory the 
symmetry is restored when the temperature exceeds ~100 GeV (see (4.142)) and 
the topological transitions are very efficient. This leads to nonconservation of the 
total fermion number. 


4.5.4 Chiral anomaly and nonconservation of the fermion number 


Chiral anomaly The gauge interactions of the massless fermions preserve both 
left- and right-handed currents, J} = Wry" Wz and Jp = Wry" Wr, at the classical 
level. This means that the fermion numbers (equal to the difference between the 
numbers of fermions and antifermions) are conserved for each helicity separately. 
As aconsequence, both the total current J“ = J; + Jp and the chiral current Js° = 
Jr — Jp are conserved. In electrodynamics the conservation of the total current is 
equivalent to charge conservation and its violation would be a disaster. On the other 
hand, the violation of the chiral current would have no dramatic consequences. 
In fact, for massive fermions the chiral current is not conserved even classically. 
Quantum fluctuations lead to a violation of chiral current conservation for massless 
fermions also. In quantum electrodynamics the triangle diagram, shown in Figure 
4.17, induces the chiral anomaly: 


2 


e = 
Onlss —a = ak. (4.192) 
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Fig. 4.17. 


The situation is very similar in non-Abelian gauge theories, where the divergences 
of the left- and right-handed currents are given by 


8° gs 
he 7 a 

Oud; = er jest (FF), Int a 
respectively. In a theory where the right- and left-handed fermions are coupled to 
the gauge field with the same strength, as for instance in quantum chromodynamics, 
CL = Cr = | forevery flavor, and the total current and as aresult the fermion number 
are conserved. On the other hand, the difference between the numbers of right- and 


left-handed fermions, 


tr(FF) , (4.193) 


Qs= ie: — J?) d’x =Nr—-N1, (4.194) 


changes in instanton transitions. Using Gauss’s theorem and taking into account 
(4.183), we obtain from (4.193) 


2 
: in & ny 
AQs = Q3 — OF = a tr(FF) d*x = 2, (4.195) 


that is, the corresponding fermion flips its helicity in the instanton. 


Violation of fermion number The situation is more interesting in chiral theories, 
where the right- and left-handed particles are coupled to the gauge fields differ- 
ently. Let us consider electroweak theory at temperatures T > 100 GeV, where the 
symmetry is restored and the rate of topological transitions is very high. The SU(2) 
gauge fields interact only with left-handed fermions and have the same strength for 
each doublet; therefore cr, = 1, cr = 0 and 


2 
8 ~ 
tr(FF) , 4.196 
1672 ( ) ( ) 
where f indicates the corresponding fermion doublet and runs from 1 to 12. For 
instance, f = 1 for the first lepton family, 





a, ye SS 


OY — eryMer + dey" Ve, (4.197) 


198 The very early universe 


and f goes from 4 to 12 for three quark families, each of which is represented in 
three different colors. From (4.196) and (4.183 ), it follows that during a topological 
transition in which the winding number increases by Av units, the fermion number 
in each doublet decreases by Av units and hence the total fermion number is not 
conserved. Taking into account that there are nine quark doublets and that the 
baryon number of every quark is equal to 1/3, we obtain the following selection 
tule: 


AL, =ALy = AL, = ZAB, (4.198) 


where AL, is the change of the lepton number in the corresponding family and AB 
is an overall change of the baryon number. Of course, the conservation laws for 
energy, total electric charge and color should be fulfilled. In an instanton/sphaleron 
transition the total number of fermions decreases by twelve units; correspondingly 
the total lepton and baryon numbers decrease by three units each: AL = AB = —3. 
The energy of the disappearing fermions is transferred to the remaining and newly 
created fermions and antifermions. One of the possible processes of this kind is 
shown in Figure 4.18. 

Thus in chiral theories topological transitions lead to nonconservation of the 
total number of left-handed fermions. In electroweak theory there exist interactions 
which convert left-handed particles to right-handed ones. Hence the total fermion 
number is not conserved and some linear combination of baryon and lepton number, 
B+ aL, should vanish at thermal equilibrium. The numerical coefficient a is of 
order unity and its exact value can be found taking into account the conservation 
laws and analyzing the conditions for the chemical equilibrium of all particles 
involved. In the Standard Model with three generations of fermions and one Higgs 
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Fig. 4.18. 
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doublet, a = 28/51. On the other hand, it follows from (4.198) that the charges 
L; —(/3) B and, as aresult B — L, are conserved. 

Topological transitions in the early universe can ensure equilibrium only if their 
rate per fermion, equal to '/n ¢ ~ at T (see ( 4.191)), exceeds the expansion rate 
H ~ T°, that is, they are efficient if T < a ~ 10!? GeV. Thus, even if B + aL 
were generated in the very early universe it would be washed out by topological 
transitions at 10! Gev > T > 10? Gev. Hence if B — L = 0 then any pre-existent 
baryon number does not survive. To explain baryon asymmetry, therefore, one has 
to find a way to generate B — L  O inthe very early universe. The other possibility 
is to generate B + aL during a violent electroweak phase transition. However, this 
does not look very realistic because the electroweak transition seems to be rather 
smooth. 


4.6 Beyond the Standard Model 


Particle theories have been probed experimentally only to an energy scale of order a 
few hundred GeV. If we want to learn anything about the early universe at energies 
above 100 GeV, we inevitably have to rely on theories which are somewhere spec- 
ulative. Fortunately they all have common features which may allow us to foresee 
possible solution of important cosmological problems. Among these problems are 
the origin of baryon asymmetry in the universe, the nature of dark matter and the 
mechanism for inflation. We devote a separate chapter to inflation; here we concen- 
trate on the first two questions. Since these questions cannot be answered within 
the Standard Model, we are forced to go beyond it. In this section we begin by 
outlining the relevant general ideas behind the extensions of the Standard Model 
and then discuss the ways in which these ideas can be implemented in cosmology. 


Grand unification The SU(3) x SU(2) x U(1) Standard Model is characterized 
by the three coupling constants g,, g and g’. They depend on the energy scale, and 
the corresponding “fine structure constants” a = g*/4z run according to (4.29). 
The strong interaction constant a, is given by (4.31). 

In the case of the SU(2) group the coefficient f/(1) in (4.29) can be inferred 
from (4.30). For g > 100 GeV all particles, including intermediate bosons, can 
be treated as massless and the number of “colors” n in (4.30) is equal to 2. The 
number of “flavors” f should be taken to be equal to half the number of left-handed 
fermion doublets (not forgetting that there is a quark doublet for each color), that 
is, f = 12/2 = 6. Hence, for the SU(2) group, 


1 
fi) = ——(2 x 6-11 x 2) © —0.265 
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and 


2 0 


2 § Ow 
ies . 4.199 
a (9°) = 1 + 0.26509 In(q2/q2) ea 





where go ~ 100 GeV and a? = ay(go) ~ 1/29. Comparing (4.31) with (4.199), 
we find that the strong and weak coupling constants meet at g ~ 10!” GeV. This 
suggests that above 10!’ GeV, strong and weak interactions may be unified in a large 
gauge group and characterized by a single coupling constant g77. The difference in 
their strength at low energies is then entirely explained by the different running of 
the coupling constants within the SU(3) and SU(2) subgroups, after the symmetry 
of this larger gauge group is broken at ~10!’ GeV. The simplest single group 
which incorporates all known fermions is the SU(5) group. It also contains a U(1) 
subgroup and could therefore include all gauge interactions of the Standard Model. 
However, to identify this subgroup with the U(1) of electroweak theory, one has to 
verify that the properly normalized U(1) fine structure constant, a, = (5/3) g”/4z, 
meets the other two constants at the correct energy scale. This is a highly nontrivial 
task. 


Problem 4.29 Find a 1(q”) . (Hint Do not forget hypercharges entering the ver- 
tices.) 


With more accurate measurements of a, and 9,, it has become clear that the 
three constants do not quite meet at a point (Figure 4.19). This fact, along with 
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Fig. 4.19. 
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measurements of the proton lifetime and the discovery of neutrino masses, rules 
out the minimal SU(5) model as a realistic theory. Nevertheless, we will take this 
model a little further in order to explain some important features common to more 
realistic theories. 

The SU(5) group has 5? — 1 = 24 generators and correspondingly there are 
24 gauge bosons. Eight of them should be identified with the gluons responsible 
for color transitions within the SU(3) subgroup. Three bosons correspond to 
the SU(2) subgroup and together with one U(1) boson are responsible for the 
electroweak interactions. The remaining 12 bosons form two charged colored 
triplets, 


xi ysl3, 


where i = r, b, g is the color index and the upper indices correspond to the electric 
charge. The X, Y bosons form a “bridge” between the SU (2) and SU(3) subgroups 
of SU(5) . After symmetry breaking (for instance, via the Higgs mechanism), they 
acquire masses of order ~10!°—10!” GeV and the transitions between SU(2) and 
SU(3) subgroups are strongly suppressed. At high energies, however, the X, Y 
bosons can “convert” quarks to leptons and vice versa very efficiently. 

The X boson can decay either into a pair of quarks (X — qq) or into an 
antiquark—antilepton pair (x > ql) . The baryon numbers of the final products 
are B = 2/3 and B = —1/3 respectively; hence baryon number is not conserved 
in the SU(5) model. On the other hand, the difference between baryon and lepton 
number is equal to 2/3 in both cases and B — L is not violated. Hence B — L cannot 
be generated and any baryon asymmetry will be washed out in subsequent topo- 
logical transitions. Thus the baryon asymmetry problem cannot be solved within 
the SU(5) model. 

The larger symmetry groups look more attractive for several different reasons. 
First of all, they have a richer fermion content than the minimal SU(5) model. 
In particular, one can incorporate a right-handed neutrino, which is needed to ex- 
plain the neutrino masses. The other attractive feature of these theories is that 
they not only violate baryon number, but also do not conserve B — L. This opens 
the door to an understanding of the origin of the baryon asymmetry of the uni- 
verse. Finally, changing the fermion content influences the running of coupling 
constants, and thus one can hope that they will yet meet at one point. There is ex- 
tensive literature dealing with large gauge groups, for example, SO (10), SO (14), 
SO(22),..., Eo, E7, Eg,.... The corresponding theories contain many particles 
not yet discovered and hence many candidates for dark matter. However, in the 
absence of solid data, we see no point in going into further detail of unified theories 
here. 
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Supersymmetry The symmetries we have considered so far relate bosons to bosons 
and fermions to fermions. However, there may be a beautiful symmetry, known as 
supersymmetry, which relates bosons and fermions. If supersymmetry is a true 
symmetry of nature, then every boson should have at least one fermionic super- 
partner, with which it is paired in the supermultiplet. Every fermion should also be 
partnered with at least one boson. Under a supersymmetry transformation, bosons 
and fermions in the same supermultiplet are “mixed” with each other. It is clear 
that the supersymmetry generator which converts a boson to a fermion should be a 
spinor Q, and in the simplest case it is a chiral spinor of spin 1/2. Because bosons 
and fermions transform differently under the Poincaré group, supersymmetry trans- 
formations, unlike gauge transformations, cannot be completely decoupled from 
spacetime transformations. In fact, the algebra of the supersymmetry generators Q 
closes only when the generators of the space and time translations are included. 
Hence, if we try to make supersymmetry local, we are forced to deal with curved 
spacetime. Local supersymmetry, called supergravity, thus offers a possible way to 
unify gravity with the other forces. 

Unfortunately, as in the case of Grand Unification, there are too many potential 
supersymmetric extensions of the Standard Model. First, the supersymmetry can be 
global or local. Second, we could include more than one boson—fermion pair in the 
same supermultiplet, an idea known as extended supersymmetry. In principle, all 
particles could be the members of a single multiplet. Extended supersymmetry is 
characterized by the number of supersymmetry generators Q', Q’, ..., Q” which 
determine the particle content of the supermultiplets. For instance, for N = 8 the 
supermultiplet contains both left- and right-handed particles of spins 0, 1/2, 1, 3/2 
and 2; thus N = 8 supergravity would be an ideal candidate for unification. Unfor- 
tunately (or fortunately, depending on one’s attitude) Nature does not act upon our 
wishes and in the absence of experimental data we must consider a diverse range 
of theoretical possibilities. 

All supersymmetric theories have features in common and we concentrate on 
those which are relevant for cosmological applications. As we have mentioned, in 
these theories bosons and fermions are paired. Disappointingly, all combine known 
fermions with unknown bosons and vice versa. Hence, supersymmetric theories 
predict that the number of particles should be at least twice as big as the number of 
experimentally discovered particles. To understand why the supersymmetric part- 
ners of the known particles have not yet been discovered, we are forced to assume 
that supersymmetry is broken above the scale currently reached by accelerators. 
It is only when supersymmetry is broken that supersymmetric partners can have 
different masses; otherwise they are obliged to have the same mass. 

In the minimal supersymmetric extension of the Standard Model, usually called 
MSSM, every quark and lepton has a supersymmetric scalar partner, called a squark 


4.6 Beyond the Standard Model 203 


and a slepton respectively. Similarly, for every gauge particle we have a fermionic 
superpartner with spin 1/2, called a gaugino. Among these, gluinos are superpart- 
ners of gluons, and winos and the bino are the superpartners of the gauge bosons of 
the electroweak group. The gauginos mediate the interaction of the scalar particles 
and their fermionic partners, with a strength determined by the gauge coupling 
constant. The Higgs particle is accompanied by a higgsino. The lightest neutral 
combination of -inos (mass eigenstate), called the neutralino, must be stable; if 
supersymmetry were broken at the electroweak scale, it would interact weakly with 
ordinary matter. Therefore, the neutralino is an ideal candidate for cold dark matter. 
To conclude our brief excursion to the “s- and -ino zoo,’ we should mention the 
gravitino — the spin 3/2 superpartner of the graviton which could also serve as a 
dark matter particle. Thus we see that supersymmetric theories provide us with 
the weakly interacting massive particles necessary to explain dark matter in the 
universe. 

Some remarkable properties of supersymmetric theories arise from the fact that 
the numbers of fermionic and bosonic degrees of freedom in these theories are equal. 
For instance, the superpartner of the left-handed fermion is a complex scalar field 
and they both have two degrees of freedom. The energy of the vacuum fluctuations 
per degree of freedom is the same in magnitude but opposite in sign for fermions and 
bosons of the same mass. Therefore, in supersymmetric theories the fermion and 
boson contributions cancel each other and the total vacuum energy density vanishes. 
In other words, the cosmological term is exactly zero. This is true, however, only if 
supersymmetry remains unbroken. But supersymmetry is broken and as a result the 
expected mismatch in vacuum energy densities, arising from the mass difference of 
the superpartners, is of order rie sy» Where Asysy is the supersymmetry breaking 
scale. If Asysy ~ 1 TeV, the cosmological constant is about ~ 10~% in Planck 
units. This number is still about 60 orders of magnitude larger than the observational 
limit. Thus we see that supersymmetry, while a step in the right direction, does not 
quite solve the cosmological constant problem. 

The last remark we wish to make here concerns the behavior of the running 
coupling constants in the minimal supersymmetric extension of the Standard Model. 
The additional supersymmetric particles influence the rate of running of the strong 
and electroweak coupling constants. As a result, the three constants meet at a single 
point with impressive accuracy (Figure 4.20). This revives our hopes of Grand 
Unification and gives an indication that we may be on the right track. 


4.6.1 Dark matter candidates 


Nucleosynthesis and CMB data clearly indicate that most of the matter in the 
universe is dark and nonbaryonic. There is no shortage of particle physics candidates 
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Fig. 4.20. 


for this dark matter. These candidates can be classified according to whether the 
dark matter particles originated via decoupling from a thermal bath or were created 
in some nonthermal process. 

In turn, thermal relics can be categorized further as to whether they were rel- 
ativistic or nonrelativistic at the moment of decoupling. The relics which were 
relativistic at this time constitute hot dark matter, while those which were nonrel- 
ativistic constitute cold dark matter. The neutrino and the neutralino are examples 
of hot and cold relics respectively. 

The simplest example of a nonthermal relic is the condensate of a weakly inter- 
acting massive scalar field. A well motivated particle physics candidate of this kind 
is the axion. Because the momentum of axions in the condensate is equal to zero, 
axions can also serve as a cold dark matter component even although their masses 
are very small. Below we describe some generic features characterizing different 
dark matter candidates. 


Hot relics The freeze-out of hot relics occurs when they are still relativistic. After 
relics decouple from matter their number density ny decreases in inverse proportion 
to the volume, that is, as a~>. The total entropy density of the remaining matter, s, 
scales in the same way and hence the ratio n,,/s has remained constant until the 
present time. 
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Problem 4.30 Using this fact and assuming that the hot relics are fermions of 
mass my with negligible chemical potential, verify that their contribution to the 
cosmological parameter is 





&y ( My 
Qyhis = (TY 4.200 

ws ~ “9. \19 eV ree) 
Here gy is the total number of degrees of freedom of the hot relics and g, = 
gp +(7/8) g¢ is the effective number of bosonic and fermionic degrees of freedom 
at freeze-out for all particles which later convert their energy into photons. 


Taking three left-handed light neutrino species of the same mass m,, we have 
gy = 6 (3 for neutrinos + 3 for antineutrinos). The neutrinos decouple at temper- 
ature ~O(1) MeV. At this time the only particles which contribute to g, are the 
electrons, positrons and photons, and g, =(2 +(7/8) x 4) = 5.5. Therefore, if the 
mass of every neutrino were about 17 eV, then neutrinos would close the uni- 
verse. According to observations, the contribution of dark matter to the total density 
does not exceed 30% (see, in particular, Chapter 9). Hence the sum of the neutrino 
masses should be smaller than 15 eV or, in the case of equal masses, m, <5 eV. 
The mass bound on hot relics species changes if we assume that they freeze out at 
a higher temperature when more relativistic particles are present. For instance, if 
decoupling happened before the quark—gluon phase transition, at T ~ 300 MeV, 
then g, = 53 (for the number of degrees of freedom at this time, see (4.33) and do 
not forget to include photons and electron—positron pairs). In this case, it follows 
from (4.200) that my < 151 eV for gy, = 2. In reality the bound on the masses 
is even stronger because hot relics cannot explain all dark matter in the universe; 
they can constitute only a subdominant fraction of it. In fact, in models where hot 
dark matter dominates, inhomogeneities are washed out by free streaming on all 
scales up to the horizon scale at the moment when the relics became nonrelativistic 
(see Section 9.2). As a result the large scale structure of the universe cannot be 
explained. Therefore, more promising and successful models are those in which 
cold relics constitute the dominant part of the dark matter. 


Thermal cold relics Cold relics x decouple at temperatures 7;, much less than 
their mass m,. Therefore, the number density n, is exponentially suppressed in 
comparison with the number density of photons. To deduce n} at the moment 
of decoupling, we simply equate the annihilation rate of relics x to the Hubble 


expansion rate: 


n® (ov). & He = (823/90)? 1772, (4.201) 
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where (ov), is the thermally averaged product of the annihilation cross-section o 
and the relative velocity v. The effective number of degrees of freedom g,. accounts 
for all particles which are relativistic at T,,. Alternatively, we know from (3.61), 
setting jz = 0, that 


3 
3/2 g—%e (4.202) 


* * 
~ gy — x ; 
Xx *(Qn)3/? * 
where x, = m,/T,. Then, using the estimate (ov), ~ 0%./T;/my,, where o, is the 
effective cross-section at T = T,, we find that at freeze-out 


aN oe In(0.038g,8, '/7o,m,) 


~ 16.3 + In[9,8;""( 5) (4) I. (4.203) 


The relics are cold only if x, > O(1); to be definite we take x, > 3. It follows 
from (4.203) that o,m, > 10° ey : 3! a (in Planck units) for cold relics. Their energy 
density today is 





SO 2 T, 0 2 
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Ee X~myn.— = —m,n.| — |] , 4.204 
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where T,,9 ~ 2.73 K and so/s, is the ratio of the present entropy density of radiation 
to the total entropy density at freeze-out of those components of matter (with 
g, effective degrees of freedom) which later transfer their entropy to radiation. 
Substituting ny from (4.201) into (4.204) we finally obtain 


x 1/2 —38 2 
3 x 10 
2,13 ~ : al z sie ). (4.205) 
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Remarkably, the contribution of cold relics to the cosmological parameter depends 
only logarithmically on their mass (through x,,) and is mainly determined by the 
effective cross-section o,, at decoupling. 

Weakly interacting massive particles, which have masses between 10 GeV and 
a few TeV and cross-sections of approximately electroweak strength ogw ~ 107*8 
cm’, are ideal candidates for cold dark matter. Their number density freezes out 
when x, ~ 20 and, as is clear from (4.205 ), they may easily contribute the nec- 
essary 30% to the total density of the universe and thus constitute the dominant 
component of dark matter. Currently, the leading weakly interacting massive parti- 
cle candidate is the lightest supersymmetric particle. Most sypersymmetric theories 
have a discrete symmetry called R-parity, under which particles have eigenvalue +1 
and superparticles —1. R-parity conservation guarantees the stability of the light- 
est supersymmetric particle. The lightest supersymmetric particle is most likely a 
neutralino, which could be (mostly) a bino or a photino. 


4.6 Beyond the Standard Model 207 


Problem 4.31 Consider the annihilation reaction and its inverse: x ¥ S f f. As- 
suming that the annihilation products f and f always have thermal distribution 
with zero chemical potential (because of some “stronger” interactions with other 
particles), derive the following equation: 

dX _ (ov) 

ds 3H 
where X = n,/s is the actual number of x particles per comoving volume and X ¢q 
is the equilibrium value. Total entropy is conserved and therefore the entropy density 
s scales as a~> . Find the approximate solutions of (4.206) for hot and cold relics and 
compare them with the numerical solutions. Determine the corresponding freeze- 
out number densities and compare the results with the estimates made in this section 
(Hint In equilibrium the direct and inverse reactions balance each other exactly, so 


that (ov) ,; nny = (ov) fF nypny.) 


(XPex2)5 (4.206) 


eq 


Nonthermal relics In general, for nonthermal relics, interactions are so weak that 
they are never in equilibrium. There is no general formula which describes the 
contribution of these relics to the total energy density because this contribution de- 
pends on the concrete dynamics. As an example of nonthemal relics we consider the 
homogeneous condensate of massive scalar particles and neglect their interactions 
with other fields. The homogeneous scalar field satisfies the equation 


g+3Ho4+m’o =0. (4.207) 


To solve it for generic H(t), it is convenient to use the the conformal time 7 = 
f dt/a as atemporal variable. Introducing the rescaled field g¢ = u/a we find that 
in terms of u, (4.207) becomes 


" 
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where a prime denotes the derivative with respect to n. If |a”" /a| ~ HH? >m’, 
the first term in the brackets can be neglected and the corresponding approximate 
solution of this equation is 


d 
u~ a(ci +Cy / 3) (4.209) 
a 


where C, and C4 are integration constants. The dominant mode given by the first 
term yields ¢ ~ C; ~ const. Therefore, while the mass m is much smaller than the 
Hubble constant H, the scalar field is frozen and its energy density 


Ey = 4(¢’ +m’¢") (4.210) 


remains constant if m = const, and resembles the cosmological constant. 
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As the Hubble constant becomes smaller than the mass, and eventually H* « 
m*, we can neglect the second term inside the brackets in (4.208). The WKB 
solution of the simplified equation is then 


u x(ma)!/? sin( f mad) (4.211) 


and correspondingly 


g «xm '/2q-3/2 sin( f mdr (4.212) 


Note that this solution is also valid for a slowly varying mass m. Substituting this 
into (4.210) we find, in the leading order, that the energy density of the scalar field 
decreases as ma~*; hence it behaves as dust-like matter (p ~ 0). This is easy to 
understand: after the value of the Hubble constant drops below that of the mass, 
the scalar field, which was frozen before, starts to oscillate and can be interpreted 
as a Bose condensate of many cold particles of mass m with zero momentum. For 
a slowly varying mass the particle number density, which is proportional to e,/m, 
decays as a~? and the total particle number is conserved. 

Using these results, we can easily calculate the current energy density of the 
scalar field (in Planck units): 


~3/4 
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gl mo(—*) O(N oe Py, (4.213) 
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where mo is its mass at present, m.,, is the mass at the moment when H, ~ m,, and 
Gin 1S the initial value of the scalar field when it was still frozen. For the case of 
constant mass ( mg = m,,), the contribution of this field to the total energy density 
is 





~3/4 1/2 2 
Qh, ~ 2 ( dia) ) ant (4.214) 
r g, \100 GeV 3 x 10° GeV 


Thus, tuning two parameters, the mass and the initial value of the scalar field, 
we have a straightforward “explanation” for the observed cold dark matter in the 
universe. 


Axions As previously mentioned, axions are an attractive nonthermal relic candi- 
date. The axion field is introduced to solve the strong CP problem. Because the 
strong coupling constant is large at low energies, topological transitions are not sup- 
pressed in quantum chromodynamics. Therefore, one expects that the true quantum 
chromodynamics vacuum is the 6 vacuum, which is a superposition of vacua with 
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different winding numbers n: 
0) = doe” In), (4.215) 


where @ is an arbitrary parameter which must be determined experimentally. As a 
result, the effective Lagrangian possesses an additional nonperturbative term 


6 —tr(FF) (4.216) 
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where F and F are the gluon field strength and its dual respectively. This term, 
being a total derivative, does not affect the equations of motion and conserves C. It 
violates CP, P and T, however, and produces a very large neutron dipole moment 
which contradicts experimental bounds unless 9 < 107!°. One either has to accept 
the smallness of 6 as fact or try to find a natural explanation for why this parameter 
is so small by introducing a new symmetry. The elegant known solution of the 
strong CP problem, suggested by Peccei and Quinn, involves an additional global 
chiral U(1) po symmetry imposed on the Standard Model. This symmetry, broken 
at a scale f, essentially serves to replace the 6 parameter by a dynamical field — 
the axion field. In many axion models, a new complex scalar field ¢ = x exp (i6) 
is used to generate a U(1)pg-invariant mass term for some colored fermions, via 
Yukawa coupling. After symmetry breaking, the field x acquires the expectation 
value f. In the case of local symmetry the field 6 would be “eaten” by the gauge 
field, but when the symmetry is global it becomes a massless degree of freedom 
and this is called the axion a. To be precise, a = f6. At the quantum level the 
chiral U(1)pg symmetry suffers from the chiral anomaly and as a result there is an 
effective interaction of the axion field with gluons: 


7 3, U(FF) ; (4.217) 
which shares the same structure as (4.216). The terms of this type generate an 
effective potential for 6 + a/f with the minimum at a = — f@ and the overall C P 
violating term vanishes in the minimum of the potential. What is most important 
for us is that in the vicinity of the potential minimum the axion acquires a small 
mass of order 


as (mma)? mz fr 6 X 10° 


m,+mqa f Scev 





eV, (4.218) 


where m, and my are the masses of the light quarks, m, ~ 130 MeV is the pion mass 
and f; ~ 93 MeV is the pion decay constant. The axion mass arises from quantum 
chromodynamics instanton effects and these are altered at finite temperatures. In 
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particular, at T >> Agcp ~ 200 MeV, we have 
4 
maT) ~ ma(Agcn/T) . (4.219) 


For realistic values of f, the axion mass becomes of order the Hubble constant at 
these high temperatures. Using H, in (4.201), we find m,(T,,) = H, when 


eo 2/3 
DO gy em hoon 
and 
a 4/3 
ms, ~ BP mB Gop. (4.220) 


Substituting this value for m,. into (4.213) and taking into account that aj, = fin, 
we finally obtain the following expression for the axion contribution to the total 
energy density: 


6x 10-F ev) eo Ne 
Qahas ~ oc(< =) 6, ~ ow ae) 67. = (4.221) 


The axion is periodic in 6 = a/f and therefore the natural value for 6;, is ~O(1). 
Thus, if m, ~ 10~> eV, the axions can constitute the dominant component of dark 
matter. Because all couplings of the axion scale as 1/f, axions interact only very 
weakly with ordinary matter and, in spite of their small mass, are cold particles. At 
first glance, only axions with masses within a very narrow window near 10~> eV 
seem to be cosmologically interesting. However, one can argue that 6;, < 1 is not 
so unnatural in inflationary cosmology. This allows the universe to close even with 
axions of much smaller mass. 

Particle theories beyond the Standard Model also provide us with other candi- 
dates for dark matter, among which are the gravitino, the axino (the superpartner of 
the axion), and the remnant of the inflaton. Their contributions can be determined 
using approaches similar to those outlined above. 


4.6.2 Baryogenesis 


The universe is asymmetric: there are more baryons than antibaryons. While an- 
tibaryons are produced in accelerators or in cosmic rays, “antigalaxies” are not 
observed. The relative excess of the baryons B =(n, — nj)/s ~ 107!° is exactly 
what we need to explain the abundance of light elements and the observations of 
CMB fluctuations (see Chapter 9). In past the baryon asymmetry could be inter- 
preted as simply due to the initial conditions in the universe. However, in the light 
inflationary cosmology, now widely accepted, this “solution” completely fails. We 
will see in the following chapter that an inflationary stage erases any pre-existing 
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asymmetry and the possibility of its dynamical generation becomes an inevitable 
element of inflationary cosmology. Theories beyond the Standard Model provide us 
with many — perhaps too many — potential solutions to this problem. Fortunately, 
any particular model for baryogenesis should possess three basic ingredients which 
are independent of the details of the actual theory. These ingredients, formulated 
by A. Sakharov in 1967, are 


(i) baryon number violation, 
(ii) C and CP violation, 
(iii) departure from equilibrium. 


The first condition is obvious and does not require a long explanation. If baryon 
number is conserved and is equal to zero at the beginning, it will remain zero forever. 
If baryon number does not satisfy any conservation law, it vanishes in the state of 
thermal equilibrium. Therefore we need the third condition. The second condition 
is less trivial: it is a prerequisite for ensuring a different reaction (decay) rate for 
particles and antiparticles. If this condition is not met, the numbers of baryons and 
antibaryons produced are equal and no net baryon charge is generated even if the 
other two conditions are fulfilled. 


Problem 4.32. Why must we require both C and C P violation? Why is C P viola- 
tion alone not enough? 


The Standard Model possesses all the ingredients necessary for the generation 
of baryon asymmetry. In fact, we have seen that baryon number is not conserved 
in topological transitions, C P is violated in weak interactions and the departure 
from thermal equilibrium naturally occurs in the expanding universe. It would be 
remarkable if the baryon number could be explained within the Standard Model 
itself. Unfortunately this seems not to work. The main obstacle is the third condition. 
For realistic values of the Higgs mass, the electroweak transition is a cross-over and 
cannot supply us with the necessary strong deviations from thermal equilibrium. 
Therefore, to explain baryon asymmetry we have to go beyond the Standard Model. 
There is a wide range of possibilities; below we outline the baryogenesis scenarios 
most commonly considered. 


Baryogenesis in Grand Unified Theories Baryon number is generically not con- 
served in Grand Unified Theories. In the SU(5) theory, for example, the heavy 
gauge boson X, responsible for “communication” between the quark and lepton 
sectors, can decay into either a gq pair or a g/ pair with baryon numbers 2/3 or 
—1/3 respectively. The antiboson X decays into a Gq or ql pair. The CPT invariance 
requires the equality of the total decay rates for X and X. However, this does not 
mean that the decay rates are equal for every particular channel. If C and CP are 
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Fig. 4.21. 


violated, the rate [(X — qq) =r is generally not equal to r(x > qq) =7.In 
fact, let us assume that the only source for C P violation is the Kobayashi-Maskawa 
mechanism. Then the coupling constants can be complex: they enter the diagrams 
describing the decay of particles, but their conjugated values enter the diagrams 
for antiparticle decay. The difference in the decay rates can be seen in the inter- 
ference of tree-level diagrams with higher-order diagrams such as in Figure 4.21. 
This difference, characterizing the degree of CP violation, is proportional to the 
imaginary part of the corresponding product of coupling constants and vanishes if 
all constants are real-valued (C P is not violated). 

One expects that at temperatures much higher than my, the X and X bosons are in 
equilibrium and their abundances are the same, ny = nx ~ n,. As the temperature 
drops to about my, the processes responsible for maintaining equilibrium become 
inefficient and the number density per comoving volume freezes out at some value 
Vx = ny/s (see Section 4.6.1). Subsequently, only the out of equilibrium decay of 
the X and X bosons is important and net baryon charge can be produced. Let us 
estimate its size. We normalize the overall decay rate, which is the same for particles 
and antiparticles, to unity. Then r(x > ql) =1-r and r(x > ql) =1-7. 
The mean net baryon number produced in the decay of the X boson is 


By =(2/3)r +(-1/3)0 — 1); 
likewise 
By =(—2/3)F +(1/3)0 —F). 
Hence the resulting baryon asymmetry is 
B= y(Bx + By) = y(r —F). (4.222) 


We see that B depends on the freeze-out concentration y, and on a parameter 
€ =(r — 7) which characterizes the amount of C P violation. The term +, is mainly 
determined by the rates of reactions responsible for equilibrium (see Section 4.6.1) 
and does not exceed unity. The parameter ¢ comes from higher-order perturbation 
theory and therefore B < 1. For example, in the minimal SU(5) model the pa- 
rameter € receives its first nontrivial contribution at the tenth order of perturbation 
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theory and the resulting baryon asymmetry is many orders of magnitude smaller 
than the required 10~'°. Another unhappy feature of the SU(5) model is that B — L 
is conserved in this theory. In this case B — L = 0 even if B ¢ 0 and any baryon 
asymmetry generated will be washed out in subsequent topological transitions. For 
successful baryogenesis we actually need to generate nonvanishing B — L. In more 
complex models both these obstacles can, in principle, be overcome. For example, 
in the SO(10) model, neither B nor B — L is conserved, and the necessary ¢ can 
be arranged. 

In reality the situation with Grand Unified Theory baryogenesis is more com- 
plicated than described above. We will see later that inflation most likely ends at 
energy scales below the Grand Unified Theory scale and hence that relativistic X 
bosons were never in thermal equilibrium. However, one can produce them (out of 
equilibrium) during the preheating phase (see Section 5.5). 


Baryogenesis via leptogenesis Baryon asymmetry can also be generated via lep- 
togenesis. What is required is a nonvanishing initial value of (B — L), . Even if 
B; = O and L; ¥ 0, then lepton number will be partially converted to baryon num- 
ber in subsequent topological transitions. Since B + aL vanishes in these transitions 
while B — L is conserved, the final baryon number is 
By =-———L 4.223 

(= Tae (4.223) 
where a = 28/51 in the Standard Model with one Higgs doublet. In turn, the initial 
nonvanishing lepton number L; can be generated in out-of-equilibrium decay of 
heavy neutrinos. 

Let us briefly discuss the motivation for the existence of such heavy neutrinos. 
The neutrino oscillations measured can be explained only if the neutrinos have non- 
vanishing masses. To generate the neutrino masses we need right-handed neutrinos 
vr. Then the Yukawa coupling term generating the Dirac masses can be written as 
in (4.82): 


LY =f xLi give thc. = — fi xd, vp + hec., (4.224) 


where i = 1, 2,3 is the lepton generation index and vz are the SU(2) gauge- 
invariant left-handed neutrinos defined in Section 4.3.4. Under the U(1) group, 
v, transforms according to (4.72) and, because ee y= 1/2, it remains invariant; 
hence the Yukawa term (4.224) is gauge-invariant only if the hypercharges of the 
right-handed neutrinos are equal to zero. The right-handed neutrinos are SU (2) 
singlets, have no color and do not carry hypercharge. Therefore a Majorana mass 
term, 


gO == 1p (08) v3, (4.225) 
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is consistent with the gauge symmetries of the theory; here the v& are the charge- 
conjugated wave functions of the right-handed neutrinos. After symmetry breaking 
the field x acquires the expectation value xo and the Yukawa term induces the Dirac 
masses described by the mass matrix (mp);; = i Xo. Considering for simplicity 
the case of one generation, we can write the total mass term as 


1 Cc 
LY = — (1 ve) ( : a) +h.c., (4.226) 


MD M VR 


taking into account that Dp vr = Dpvy. The mass matrix in (4.226) is not diagonal. 


When mp < M, the mass eigenvalues 


iii Hus HE (4.227) 
MS M ’ N— ‘i 
correspond to the eigenstates 
vYvetur, NX vet+vp, (4.228) 


which describe the Majorana fermions (v = v°, N = N°) — light and heavy neutri- 
nos respectively. Choosing for m p the largest known fermion mass of order the top 
quark mass, mp ~ m, ~ 170 GeV, and taking M ~ 3 x 10!5 GeV, we find from 
(4.227) that m, ~ 107? eV, which is favored by neutrino oscillation measurements. 
Ifmp ~ m,. ~ 0.5 MeV, then the mass of the heavy neutrino should be ~ 2 x 10° 
GeV. It is important to note that the Majorana mass terms are not generated via 
the Higgs mechanism, and therefore can be much larger than the masses of ordi- 
nary quarks and leptons. This leads to light neutrino masses that are very small, 
according to (4.227 ). Such a method of obtaining very small masses is known as 
the seesaw mechanism. If one were restricted to having only Dirac mass terms, the 
Yukawa couplings would have to be unnaturally small. 

The Majorana mass terms violate lepton number by two units. The heavy Ma- 
jorana neutrinos N = N° are coupled to the Higgs particles via (4.224) and they 
can decay into a lepton—Higgs pair, N — /@, or into the CP conjugated state, 
N + 1@, thus violating lepton number (Figure 4.22). Returning to the case of 
three generations, we see that the neutrino mass eigenstates do not necessarily 


Fig. 4.22. 
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coincide with the flavor (weak) eigenstates. These states are related by the cor- 
responding Kobayashi-Maskawa mixing matrix. This naturally explains neutrino 
oscillations and generically leads to complex Yukawa couplings which are a source 
of C P violation. As a result the decay rates 

T(N > 1¢) = 50 +8) lor, P(N > 16) = $0 — €) Trot (4.229) 
are different by the parameter ¢ < 1, which measures the amount of CP viola- 
tion. The final products have different lepton numbers and the mean net lepton 
number from the decay of the N neutrino is equal to ¢. Heavy neutrinos can be 
produced after inflation, either in the preheating phase or after thermalization. Sub- 
sequently, their concentration freezes out and the lepton asymmetry is produced in 
out-of-equilibrium decays of heavy neutrinos. In the topological transitions which 
follow, this asymmetry is partially transferred to baryon asymmetry in an amount 
given by (4.223). Detailed calculations show that for the range of parameters sug- 
gested by the measured neutrino oscillations, one can naturally obtain the observed 
baryon asymmetry via leptogenesis. At present this theory is considered the leading 
baryogenesis scenario. 


Other scenarios In addition to Grand Unified Theory baryogenesis and leptogene- 
sis there exist other mechanisms for explaining baryon asymmetry. Supersymmetry 
in particular opens a number of options. Since supersymmetry extends the particle 
content of the theory near the electroweak scale, the possibility of a strong elec- 
troweak phase transition cannot yet be completely excluded. This revives the hope 
of explaining baryon asymmetry entirely within the MSSM. 

Another interesting consequence of supersymmetry is found in the Affleck—Dine 
scenario. This scenario is based on the observation that in supersymmetric theories 
ordinary quarks and leptons are accompanied by supersymmetric partners — squarks 
and sleptons — which are scalar particles. The corresponding scalar fields carry 
baryon and lepton number, which can in principle be very large in the case of a 
scalar condensate (classical scalar field). An important feature of supersymmetry 
theories is the existence of “flat directions” in the superpotential, along which the 
relevant components of the complex scalar fields g can be considered as massless. 
Inflation displaces a massless field from its zero position (see Chapters 5 and 8) 
and establishes the initial conditions for subsequent evolution of the field. The 
condensate is frozen until supersymmetry breaking takes place. Supersymmetry 
breaking lifts the flat directions and the scalar field acquires mass. When the Hubble 
constant becomes of order this mass, the scalar field starts to oscillate and decays. 
At this time B, L and C P violating terms (for example, quartic couplings 


A19°¢* +c.c. and Arg* + c.c. 
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with complex 4;, 42) become important, and a substantial baryon asymmetry can 
be produced. The scalar particles decay into ordinary quarks and leptons, transfer- 
ring to them the generated baryon number. The Affleck—Dine mechanism can be 
implemented at nearly any energy scale, even below 200 GeV. By suitable choice 
of the parameters, one can explain almost any amount of baryon asymmetry. This 
makes the Affleck—Dine scenario practically unfalsifiable and it is a very unattrac- 
tive feature of this scenario. 

More exotic possibilities have also been considered. Among them are baryo- 
genesis via black hole evaporation and leptogenesis with very weakly coupled 
right-handed Dirac neutrinos. Although at present the accepted wisdom favors lep- 
togenesis, it is not clear which scenario was actually realized in nature. Therefore the 
main lesson of this section is that there exist many ways to “solve” the baryogenesis 
problem. 


4.6.3 Topological defects 


Topological defects do not occur in the Standard Model. However, they are a rather 
generic prediction of theories beyond the Standard Model. Below we briefly discuss 
why unified theories lead to topological defects and what kind of defects can be 
produced in the early universe. 

The Higgs mechanism has become an integral part of modern particle physics. 
The main feature of this mechanism is the existence of scalar fields used to break 
the original symmetry of the theory. Depending on the model, their Lagrangian can 
be written as 


1 x 
Lo = 5(da (0%) — 5 ($6? - ey). (4.230) 


where @ = (¢'. Peeves o") is an n-plet of real scalar fields. Complex scalar fields 
can be also rewritten in the form (4.230) if we use their real and imaginary parts. 
For example, in U(1) gauge theory, gy = ¢! + id? and n = 2. The doublet of the 
complex fields of electroweak theory corresponds ton = 4. 

At very high temperatures symmetry is restored, that is, @ = 0. As the universe 
cools, phase transitions take place. As a result the scalar fields acquire vacuum 
expectation values corresponding to the minimum of the potential in (4.230), 


2 2 n\2 
AO ele) heey 
This vacuum manifold M has a nontrivial structure. For example, for n = 1, both 
g =o and g = —<o are states of minimal energy and so the vacuum manifold has 


the topology of a zero-dimensional sphere, S° = {—1, +1}. In U(1) theory, the 
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vacuum is isomorphic to a circle S! (the “bottom of the bottle” ). In the case n = 3, 
the vacuum states form a two dimensional sphere S?. 

Topological defects are solitonic solutions of the classical equations for the 
scalar (and gauge) fields. They can be formed during a phase transition and since 
they interpolate between vacuum states they reflect the structure of the vacuum 
manifold. One real scalar field with two degenerate vacua (n = 1 and M = 5°) 
leads to domain walls. In the case of a complex scalar field (n = 2, M = S') 
cosmic strings can be formed. If the symmetry is broken with a triplet of real scalar 
fields (n = 3, M = S*), the topological defects are monopoles. Finally, in the 
case n = 4 (four scalar fields or, equivalently, a doublet of complex scalar fields) 
the vacuum manifold is a 3-sphere S? and the corresponding defects are textures. 
Depending on whether we consider a theory with or without local gauge invariance, 
the topological defects are called local or global respectively. 


Domain walls Let us first consider one real scalar field which has the double-well 
potential in (4.230). The states @ = o and ¢ = —o correspond to two degenerate 
minima of the potential and during symmetry breaking the field @ acquires the 
values o and —o with equal probability. The important thing is that the phase 
transition sets the maximum distance over which the scalar field is correlated. It is 
obvious that in the early universe the correlation length cannot exceed the size of 
the causally connected region. Let us consider two causally disconnected regions A 
and B, and assume that the field ¢ in region A went to the minimum at o. The field 
in region B does not “know” what happened in region A and, with probability 1/2, 
goes to the minimum at —o. Since the scalar field changes continuously from —o 
to o, it must vanish on some two-dimensional surface separating regions A and B. 
This surface, determined by the equation b(x', x7, x?) = 0, is called the domain 
wall (see Figure 4.23). The domain wall (Figure 4.24(a)) has a finite thickness /, 
which can be estimated with the following simple arguments. Let us assume, for 
simplicity, that the domain wall is static and not curved. The energy density of the 
scalar field is 


€ = $(0;¢) + V; (4.231) 


it is distributed as shown in Figure 4.24(b). The total energy per unit surface area 
can be estimated as 


ee 
Paes (=) 1+ 2041, (4.232) 


where the first contribution comes from the gradient term. This energy is minimized 
for 1 ~ 47'/2a7! and is equal to E,, ~ 41/20°. 
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domain wall string monopole 


Fig. 4.23. 


Problem 4.33 Consider an infinite wall in the x—y plane and verify that (z) = 
o tanh (z/1), where 1 =(A/2)~'/?.o“!, is the solution of the scalar field equation 
with the potential in (4.230). 


Domain walls are nonperturbative solutions of the field equations and they are 
stable with respect to small perturbations. To remove the wall described by the 
solution in Problem 4.33, one has to “lift” the scalar field over the potential barrier 
from ¢ = o to ¢ = —o in infinite space. This costs an infinite amount of energy. 

It is clear from the previous discussion that on average at least one domain 
wall per horizon volume is formed during the cosmological phase transition. The 
subsequent evolution of the domain wall network is rather complicated and has 
been investigated numerically. The result is that one expects at least one domain 











(a) (b) 
Fig. 4.24. 
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wall per present horizon scale ~ fo. Its mass can be estimated as 
Mya ~ Ewte ~ 10°A"/7(o/100 GeV)? g. 


For realistic values of A and o, the mass of the domain wall exceeds the mass of 
matter within the present horizon by many orders of magnitude. Such a wall would 
lead to unacceptably large CMB fluctuations. Therefore, domain walls are cosmo- 
logically admissible only if the coupling constant A and the symmetry breaking 
scale o are unjustifiably small. 

Homotopy groups give us a useful unifying description of topological defects. 
Maps of the n-dimensional sphere S” into a vacuum manifold M are classified 
by the homotopy group z,(M). This group counts the number of topologically 
inequivalent maps from S” into M that cannot be continuously deformed into 
each other. In the language of homotopy groups, domain walls correspond to the 
group 7g (M = SY), which describes the maps of a zero-dimensional sphere S° = 
{—1, +1} to itself. This group is nontrivial and is isomorphic to the group of integers 
under addition modulo 2, that is, 7 (S°) = Zp). 


Cosmic strings If the symmetry breaking occurs via a U(1) complex scalar field 
g=¢' 4+ id? or, equivalently, with two real scalar fields ¢', ¢’, cosmic strings are 
formed. In this case the vacuum manifold described by 


(¢') +(#?)" = 0? (4.233) 


is obviously a circle S!. 

Let us again consider two causally disconnected regions A and B and assume that 
inside region A the scalar fields acquired the expectation values ¢| > Oand ¢3 > 0 
satisfying (4.233). Because of the absence of communication, the expectation values 
of the fields in region B are not correlated with those in region A and both can take 
negative values: f; < 0 and db; < 0 (the only restriction is that they have to satisfy 
(4.233)). The probability of this happening is 1/4. The fields are continuous and, in 
changing from negative to positive values, must vanish somewhere between regions 
A and B. Namely, field ¢! vanishes on the two-dimensional surface determined 
by the equation ¢! os ban x?) = 0, while field #” is equal to zero on the surface 
described by ¢?(x', a x?) = 0. These two surfaces generically cross each other 
along a curve which is either infinite or closed. On this curve, ¢! = ¢* = 0, and 
we have a false vacuum. Thus, as a result of symmetry breaking, one-dimensional 
topological objects — cosmic strings — are formed (see Figure 4.23). It is clear that 
they are produced with an abundance at least of order one per horizon volume. 

As with domain walls, strings have a finite thickness. The field g smoothly 
changes from zero in the core of the string and approaches the true vacuum, |g| = o, 
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far away from it. Strings are topologically stable, classical solutions of the field 
equations. 

In the language of homotopy groups, strings correspond to the mappings of a 
circle S! to the vacuum manifold M = S'. The corresponding group is nontrivial: 
Ty (S t) = Z, where Z is the group of integers under addition. Taking a circle y(t) 
in x space, let us consider its map to M: g(t) = o exp(iO(t)). Because the complex 
field g is an unambiguous function of the spatial coordinates x, the phase 6 changes 
by 27m around the circle y, where m is an integer. If m = 0, the mapping is trivial 
and the contour y can be continuously deformed to a point without passing through 
the region of false vacuum. Hence it does not contain any topological defects. The 
map with m + 0 wraps the circle y around the vacuum manifold m times. Form = 1 
there is a string inside the contour y. In fact, considering two points with 6 = 0 
and 6 = z, where ¢ is equal to o and —o respectively, and shrinking the contour, 
one necessarily arrives at a place where the field g vanishes because otherwise it 
would have infinite derivative. This is where the string lives. 

Let us take for simplicity a straight global string (no gauge fields are present) 
and consider its energy per unit length. At a large distance r from the core of the 
string, the derivative 0;g can be estimated on dimensional grounds as o/r. Hence 
the gradient term gives a logarithmically divergent contribution to the energy per 
unit length: 


liga / (1/r?) d?x. (4.234) 


In this case the natural regularization factor is the distance to the nearest string. 
As we have seen, axions assume a global Upg(1) symmetry and therefore global 
axionic strings can exist. 

In a theory with local gauge invariance the derivative 0; is replaced by Djg = 
0;9 + ieA;@; the properties of the Jocal strings are very different from those of 
global strings. Solving the coupled system of equations for g and A;, one can show 
that the gauge field compensates the leading term in 0;g and that the covariant 
derivative D;g decays faster than 1/r as r > oo. As a result the energy per unit 
length converges. Writing g = x exp(i@) , and assuming that 0; x decays faster than 
1/r, we find that the compensation takes place only if 


A; > (1/e) 0;6 


asr — oo. Taking acontour y, far away from the core of the string, and calculating 
the flux of the magnetic field (B = V x A) we immediately find 


2 
/ Bda2o = f Aids’ Basar (4.235) 
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Here we take into account that 6 changes around the contour by the integer m 
multiplied by 277. Thus, in the gauged U(1) theory, strings carry a magnetic flux 
inversely proportional to the electric charge e, and this flux is quantized. 

After symmetry breaking the vector and scalar fields acquire masses m4 = eo 
andm, = ./2io respectively. These masses determine the “thickness” of the string. 
Outside the string core both fields tend to their true vacuum values exponentially 
quickly. This is not surprising because in the broken U(1) gauge theory no massless 
fields remain after symmetry breaking (compare this to the global string, where one 
massless scalar field “survives”’). The thickness of the string core is determined by 
the Compton wavelengths 6, ~ my! and 64 ~ ih Form, > ma, the size of the 
magnetic core (~ 6,4) exceeds the size of the false vacuum tube (~ r) a) In this case, 
the scalar and magnetic fields give about the same contribution to the energy per 
unit length: 


wx) ~ ho482 ~ 0? and (A) ~ B753 ~ (e52) 83 ~ 0. 
The total mass of a string with length of order the present horizon is about 
ot) ~ 108(/10'5 GeV)’ g. 


Hence, even if symmetry breaking occurred at Grand Unified Theory scales, any 
strings produced would not be in immediate conflict with observations. Moreover, 
such Grand Unified Theory strings could, in principle, serve as the seeds for galaxy 
formation. CMB measurements, however, rule out this possibility; cosmic strings 
cannot play a dominant role in structure formation. Nevertheless this does not mean 
that they were not produced in the early universe. If they were detected, cosmic 
strings would reveal important features of the theory beyond the Standard Model. 


Monopoles Monopoles arise if the vacuum manifold has the topology of a two- 
dimensional sphere; for example, in theories where the symmetry is broken with 
three real scalar fields. In this case the vacuum manifold described by 


(6!) +(¢?) +(¢3)" = 0? (4.236) 


is obviously S?. Again considering two causally disconnected regions A and B, 
we find that with probability of order unity one can obtain after symmetry break- 
ing ', > 0 and b's < 0, where i = 1, 2,3 and the fields ¢’ satisfy (4.236). Three 
two-dimensional hypersurfaces, determined by the equations ¢' (es, ae x) = 0, 
generically cross each other at a point. This is the point of false vacuum because 
there all three fields ¢' vanish. Thus a zero-dimensional topological defect — a 
monopole — is formed (Figure 4.23). Solving the equations for the scalar field, 
one can find the classical spherically symmetric scalar field configuration which 


222 The very early universe 


corresponds to the false vacuum in the center and approaches the true vacuum 
as r — oo. Without going into the detailed structure of the exact solutions, we 
can analyze the properties of monopoles using dimensional arguments. In the the- 
ory without gauge fields, two massless bosons survive after symmetry breaking. 
Therefore the fields #', smoothly changing from zero in the core of the monopole, 
approach their true vacuum configuration only as some power of distance r. On 
dimensional grounds, d¢ ~ o/r (here and in the following formulae we skip all 
indices) and the mass of the global monopole 


2 1 3 
M~o? | sd*x (4.237) 
r 


diverges linearly. The cut-off scale should be taken to be of order the correlation 
length; this can never exceed the horizon scale. 

Local monopoles have quite different properties. As an example, let us consider 
SO(3) ~ SU(2) local gauge theory with the triplet of real scalar fields. After sym- 
metry breaking, two of the three gauge fields acquire mass mw = eo. One gauge 
field, A, remains massless and U(1) gauge invariance survives. The massive vector 
fields W decay exponentially quickly beyond their interaction distance, determined 
by the Compton wavelength dw ~ Moe In the monopole solution the massless 
U(1) gauge field A compensates the gradient term in the covariant derivative and 
Do = 0¢ + eAd decays exponentially as r + oo. Therefore at large distances r 
we have 


_ 1d¢  C/e) 
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and the corresponding magnetic field is of order 
1 
ee ee (4.239) 
r 


Thus the local monopole has a magnetic charge g ~ 1/e. The exact calculation 
gives g = 27/e in agreement with the result for the Dirac monopole. We have to 
stress, however, that in contrast with the Dirac monopole, which is a fundamental 
point-like magnetic charge, monopoles in gauge theories are extended objects. They 
are classical solutions of the field equations. As with the local string, the monopole 
has two cores — scalar and magnetic — with radii 


bs ~ m-! =(2A) 1797! and éw~ Wigs =e 'g! 


respectively. Let us assume that the scalar core is smaller than the magnetic one, 
that is, dy > ds. It is easy to verify that in this case the dominant contribution to 
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the mass comes from the gauge fields: 


2 


2 
& my 
M ~ BS, ~ (=) by ~ —. (4.240) 
Ww 


One can show that this estimate, obtained for 4 > e?, still applies in the more 
complicated case A < e?. 

The existence of magnetic monopoles is an inevitable consequence of Grand Uni- 
fied Theories with (semi)simple gauge groups G. This general result follows imme- 
diately from the topological interpretation of monopole solutions. Monopoles exist 
if the vacuum manifold M contains noncontractible 2-surfaces, or equivalently, the 
group 2(M) is nontrivial. In the case above, M = S” and 17 (S*) = Z. In Grand 
Unified Theories, a semisimple group G is broken to H = SUQ3)gcp X UDem; 
the vacuum manifold is the quotient group G modulo H, thatis, M = G/H. Using 
a well known result from the theory of homotopy groups, 


m(G/H) = m1(H) = m(SUGB)gcp X UDem) = Z, (4.241) 


we conclude that monopoles are unavoidable if the unification group incorporates 
electromagnetism. Furthermore, because the SU(3)9cp gauge fields are confined, 
the monopoles carry only the magnetic charge of the U(1),,, group. 


Problem 4.34 Why are the arguments presented above not applicable to elec- 
troweak symmetry breaking? 


Let us estimate the abundance of Grand Unified Theory monopoles produced at 
Tout ~ 10!5 GeV. As we have already said, at least one defect per horizon volume 
is created during symmetry breaking. Taking into account that the horizon scale at 
this time is about ty ~ 1/ Tete we immediately obtain the following estimate for 
the average number of monopoles per photon: 

NM 1 3 
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ny ~ Téurtin A 
This ratio does not change significantly during the expansion of the universe and 
so the present energy density of the GUT monopoles should be 
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T. 3 
~ 10-16( el )( GUE ) gcm, (4.243) 





10° GeV/ \ 10° GeV 


But this is at least 10'° times the critical density — obviously a cosmological 
disaster. Either one has to abandon Grand Unified Theories or find a solution to 
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this monopole problem. Inflationary cosmology provides us with such a solution. 
If the monopoles were produced in the very early universe, then a subsequent 
inflationary stage would drastically dilute their number density, leaving less than 
one monopole per present horizon scale. Of course, this solution works only if the 
reheating temperature after inflation does not exceed the Grand Unified Theory 
scale; otherwise monopoles are produced in unacceptable amounts after the end of 
inflation. We will see in the following sections that this assumption about the energy 
scale of inflation is in agreement with contemporary ideas. Moreover, according 
to inflationary scenarios, it is likely that the temperature in the universe was never 
larger than the Grand Unified Theory scale and hence that monopoles were never 
produced according to the mechanism described above. This, however, does not 
mean that primordial Grand Unified Theory monopoles do not exist. In principle 
they could be produced during a preheating phase after inflation (see Section 5.5) 
in amounts allowed by present cosmological bounds. The search for primordial 
monopoles remains important. 


Textures The other possible defect — a texture — arises when the symmetry is 
broken with four real scalar fields ¢', i= 1,...,4.Inthis case the vacuum manifold 
is a 3-sphere S? and the textures are classified by the homotopy group 773(M). 
Because four equations 
di (x!, x”, x9) =0 

for three variables x!, x*, x> generically have no solutions, regions of false vacuum 
are not formed during the phase transition. However, the fields ¢' are uncorre- 
lated on superhorizon scales and therefore (0)? is generally different from zero 
even if @” = o” and V(#) = 0. The resulting stable structure has positive energy 
and is called a global texture. Some time ago global textures were considered a 
compelling mechanism for explaining the structure of the universe. However, the 
texture scenario is in contradiction with measurements of CMB fluctuations and 
hence textures cannot play any significant role in structure formation. 

Static textures do not “survive” in local gauge theories. In this case the gauge 
fields exactly compensate the spatial gradients of the scalar fields; as distinct from 
strings and monopoles, textures correspond to the true vacuum everywhere. As a 
result (D@)* vanishes and the total energy of a local texture is equal to zero. 

In the Standard Model, electroweak symmetry is broken with a doublet of com- 
plex scalar fields, or equivalently, with four real scalar fields. Hence, the only 
topological defects which could occur are textures. However, because this theory 
possesses local gauge invariance, the corresponding static textures have zero energy 
and they are not very interesting. 
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In concluding this section, we would like to warn the reader that the above con- 
siderations are simplified. To analyze defects in realistic theories with complicated 
symmetry breaking schemes, we have to use more powerful methods which go 
far beyond the scope of this book. In these theories hybrid topological defects, for 
example, strings with monopoles at their ends, or string-bounded domain walls, 
can exist. 


5 


Inflation I: homogeneous limit 


Matter is distributed very homogeneously and isotropically on scales larger than a 
few hundred megaparsecs. The CMB gives us a “photograph” of the early universe, 
which shows that at recombination the universe was extremely homogeneous and 
isotropic (with accuracy ~ 10~*) on all scales up to the present horizon. Given that 
the universe evolves according to the Hubble law, it is natural to ask which initial 
conditions could lead to such homogeneity and isotropy. 

To obtain an exhaustive answer to this question we have to know the exact 
physical laws which govern the evolution of the very early universe. However, 
as long as we are interested only in the general features of the initial conditions 
it suffices to know a few simple properties of these laws. We will assume that 
inhomogeneity cannot be dissolved by expansion. This natural surmise is supported 
by General Relativity (see Part II of this book for details). We will also assume that 
nonperturbative quantum gravity does not play an essential role at sub-Planckian 
curvatures. On the other hand, we are nearly certain that nonperturbative quantum 
gravity effects become very important when the curvature reaches Planckian values 
and the notion of classical spacetime breaks down. Therefore we address the initial 
conditions at the Planckian time t; = tp) ~ 10~* s. 

In this chapter we discuss the initial conditions problem we face in a decelerating 
universe and show how this problem can be solved if the universe undergoes a stage 
of the accelerated expansion known as inflation. 


5.1 Problem of initial conditions 


There are two independent sets of initial conditions characterizing matter: (a) its 
spatial distribution, described by the energy density ¢(x) and (b) the initial field of 
velocities. Let us determine them given the current state of the universe. 


Homogeneity, isotropy (horizon) problem The present homogeneous, isotropic do- 
main of the universe is at least as large as the present horizon scale, cfg ~ 1078 cm. 
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Initially the size of this domain was smaller by the ratio of the corresponding scale 
factors, a; /ay. Assuming that inhomogeneity cannot be dissolved by expansion, we 
may safely conclude that the size of the homogeneous, isotropic region from which 
our universe originated at t = t; was larger than 
li ~ cto. (5.1) 
ao 
It is natural to compare this scale to the size of a causal region J, ~ ct;: 
io Ge (5.2) 
I. ti ao 
To obtain a rough estimate of this ratio we note that if the primordial radiation 
dominates at t; ~ tp;, then its temperature is Tp; ~ 10° K. Hence 


(a;/ao) ~ (To/Tpi) ~ 10-* 
and we obtain 
lj 10!7 


i, 10-8 
Thus, at the initial Planckian time, the size of our universe exceeded the causality 
scale by 28 orders of magnitude. This means that in 10** causally disconnected 
regions the energy density was smoothly distributed with a fractional variation not 
exceeding 5¢/e ~ 10~*. Because no signals can propagate faster than light, no 
causal physical processes can be responsible for such an unnaturally fine-tuned 
matter distribution. 


1077 ~ 10", (5.3) 





Assuming that the scale factor grows as some power of time, we can use an 
estimate a/t ~ a and rewrite (5.2) as 
ii Gj 


Se (5.4) 
be ag 


Thus, the size of our universe was initially larger than that of a causal patch by 
the ratio of the corresponding expansion rates. Assuming that gravity was always 
attractive and hence was decelerating the expansion, we conclude from (5.4) that 
the homogeneity scale was always larger than the scale of causality. Therefore, the 
homogeneity problem is also sometimes called the horizon problem. 


Initial velocities (flatness) problem Let us suppose for a minute that someone has 
managed to distribute matter in the required way. The next question concerns initial 
velocities. Only after they are specified is the Cauchy problem completely posed 
and can the equations of motion be used to predict the future of the universe 
unambiguously. The initial velocities must obey the Hubble law because otherwise 
the initial homogeneity is very quickly spoiled. That this has to occur in so many 
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causally disconnected regions further complicates the horizon problem. Assuming 
that it has, nevertheless, been achieved, we can ask how accurately the initial Hubble 
velocities have to be chosen for a given matter distribution. 

Let us consider a large spherically symmetric cloud of matter and compare its 
total energy with the kinetic energy due to Hubble expansion, E“. The total energy 
is the sum of the positive kinetic energy and the negative potential energy of the 
gravitational self-interaction, EL”. It is conserved: 


EB = EF + EP = EG + ED. 
Because the kinetic energy is proportional to the velocity squared, 
EX = Ey(ai/ao)” 


and we have 





Ett ERA EP EKA EP vay\? 
k k = k % : (5.5) 
E; E; Eo aj 
Since Ek ~ |; and do/a; < 10-78, we find 
tot 
ee <10-*. (5.6) 


L 
This means that for a given energy density distribution the initial Hubble velocities 
must be adjusted so that the huge negative gravitational energy of the matter is 
compensated by a huge positive kinetic energy to an unprecedented accuracy of 
10->4%. An error in the initial velocities exceeding 10-*4% has a dramatic conse- 
quence: the universe either recollapses or becomes “empty” too early. To stress the 
unnaturalness of this requirement one speaks of the initial velocities problem. 


Problem 5.1 How can the above consideration be made rigorous using the Birkhoff 
theorem? 


In General Relativity the problem described can be reformulated in terms of the 
cosmological parameter (Q(t) introduced in (1.21). Using the definition of Q(t) we 
can rewrite Friedmann equation (1.67) as 





k 
Q(t) — 1 = —~ 2 
(t) aap (5.7) 
and hence 
a a (Hayy _ _ (4% : —56 
Q; — 1 =(Qo ) Hay? =(Q6 »(2) <10°°. (5.8) 
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Note that this relation immediately follows from (5.5) if we take into account 
that Q = |E?| / E* (see Problem 1.4). We infer from (5.8) that the cosmological 
parameter must initially be extremely close to unity, corresponding to a flat universe. 
Therefore the problem of initial velocities is also called the flatness problem. 


Initial perturbation problem One further problem we mention here for complete- 
ness is the origin of the primordial inhomogeneities needed to explain the large-scale 
structure of the universe. They must be initially of order de/¢ ~ 10~> on galac- 
tic scales. This further aggravates the very difficult problem of homogeneity and 
isotropy, making it completely intractable. We will see later that the problem of 
initial perturbations has the same roots as the horizon and flatness problems and 
that it can also be successfully solved in inflationary cosmology. However, for the 
moment we put it aside and proceed with the “more easy” problems. 

The above considerations clearly show that the initial conditions which led to the 
observed universe are very unnatural and nongeneric. Of course, one can make the 
objection that naturalness is a question of taste and even claim that the most simple 
and symmetric initial conditions are “more physical.” In the absence of a quan- 
titative measure of “naturalness” for a set of initial conditions it is very difficult 
to argue with this attitude. On the other hand it is hard to imagine any measure 
which selects the special and degenerate conditions in preference to the generic 
ones. In the particular case under consideration the generic conditions would mean 
that the initial distribution of the matter is strongly inhomogeneous with d¢/e = | 
everywhere or, at least, in the causally disconnected regions. 

The universe is unique and we do not have the opportunity to repeat the “ex- 
periment of creation”. Therefore cosmological theory can claim to be a successful 
physical theory only if it can explain the state of the observed universe using simple 
physical ideas and starting with the most generic initial conditions. Otherwise it 
would simply amount to “cosmic archaeology,” where “cosmic history” is written on 
the basis of a limited number of hot big bang remnants. If we are pretentious enough 
to answer the question raised by Einstein, “What really interests me is whether God 
had any choice when he created the World,” we must be able to explain how a partic- 
ular universe can be created starting with generic initial conditions. The inflationary 
paradigm seems to be a step in the right direction and it strongly restricts “God’s 
choice.” Moreover, it makes important predictions which can be verified experi- 
mentally (observationally), thus giving cosmology the status of a physical theory. 


5.2 Inflation: main idea 


We have seen so far that the same ratio, d;/dp, enters both sets of independent 
initial conditions. The large value of this ratio determines the number of causally 
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disconnected regions and defines the necessary accuracy of the initial velocities. 
If gravity was always attractive, then a;/dao is necessarily larger than unity be- 
cause gravity decelerates an expansion. Therefore, the conclusion d;/dg >> 1 can 
be avoided only if we assume that during some period of expansion gravity acted 
as a “repulsive” force, thus accelerating the expansion. In this case we can have 
a; /aq < 1 and the creation of our type of universe from a single causally connected 
domain may become possible. A period of accelerated expansion is a necessary 
condition, but whether is it also sufficient depends on the particular model in which 
this condition is realized. With these remarks in mind we arrive at the following 
general definition of inflation: 


Inflation is a stage of accelerated expansion of the universe when gravity acts as a repulsive 
force. 


Figure 5.1 shows how the old picture of a decelerated Friedmann universe is 
modified by inserting a stage of cosmic acceleration. It is obvious that if we do not 
want to spoil the successful predictions of the standard Friedmann model, such as 
nucleosynthesis, inflation should begin and end sufficiently early. We will see later 
that the requirement of the generation of primordial fluctuations further restricts 
the energy scale of inflation; namely, in the simple models inflation should be over 
at tr ~ 10-*4-10-*6 s. Successful inflation must also possess a smooth graceful 
exit into the decelerated Friedmann stage because otherwise the homogeneity of 
the universe would be destroyed. 

Inflation explains the origin of the big bang; since it accelerates the expan- 
sion, small initial velocities within a causally connected patch become very large. 
Furthermore, inflation can produce the whole observable universe from a small 
homogeneous domain even if the universe was strongly inhomogeneous outside of 
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\ decelerated Friedmann expansion 







oe inflation 
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Fig. 5.1. 
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this domain. The reason is that in an accelerating universe there always exists an 
event horizon. According to (2.13) it has size 


om od 
ra) =att) [ - =a) | (5.9) 
t at) 


The integral converges even if Gmax —> 00 because the expansion rate a grows with 
a. The existence of an event horizon means that anything at time f a distance larger 
than r,(t) from an observer cannot influence that observer’s future. Hence the future 
evolution of the region inside a ball of radius r,(t) is completely independent of the 
conditions outside a ball of radius 2r,(t) centered at the same place. Let us assume 
that at t = ¢; matter was distributed homogeneously and isotropically only inside a 
ball of radius 2r,(¢;) (Figure 5.2). Then an inhomogeneity propagating from outside 
this ball can spoil the homogeneity only in the region which was initially between 
the spheres of radii r,(t;) and 2r,(t;). The region originating from the sphere of 
radius r.(t;) remains homogeneous. This internal domain can be influenced only by 
events which happened at ¢; between the two spheres, where the matter was initially 
distributed homogeneously and isotropically. 
The physical size of the homogeneous internal region increases and is equal to 


rn (tp) = re(ti) — (5.10) 


at the end of inflation. It is natural to compare this scale with the particle horizon 
size, which in an accelerated universe can be estimated as 


rp(t) = a(t) / ae / a OO. (5.11) 
a ad 
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since the main contribution to the integral comes froma ~ a;. At the end of inflation 
rp (t;) ~Th (t r)s that is, the size of the homogeneous region, originating from a 
causal domain, is of order the particle horizon scale. 

Thus, instead of considering a homogeneous universe in many causally discon- 
nected regions, we can begin with a small homogeneous causal domain which 
inflation blows up to a very large size, preserving the homogeneity irrespective of 
the conditions outside this domain. 


Problem 5.2 Why does the above consideration fail in a decelerating universe? 


The next question is whether we can relax the restriction of homogeneity on 
the initial conditions. Namely, if we begin with a strongly inhomogeneous causal 
domain, can inflation still produce a large homogeneous universe? 

The answer to this question is positive. Let us assume that the initial energy 
density inhomogeneity is of order unity on scales ~ js ae that is, 


b& 1|Ve Vel 1 
(=) pie teel lao = el Ld, ~ OU), (5.12) 
E /, E aj E Qj 

where V is the spatial derivative with respect to the comoving coordinates. At 
t > ¢;, the contribution of this inhomogeneity to the variation of the energy density 


within the Hubble scale H(t)~! can be estimated as 


(=) ENE tO) (5.13) 
Spe £ at) a(t) 

where we have assumed that |Ve|/e does not change substantially during ex- 
pansion. This assumption is supported by the analysis of the behavior of linear 
perturbations on scales larger than the curvature scale H~! (see Chapters 7 and 
8). It follows from (5.13) that if the universe undergoes a stage of acceleration, 
that is, a(t) > a; for t > t;, then the contribution of a large initial inhomogeneity 
to the energy variation on the curvature scale disappears. A patch of size H~! be- 
comes more and more homogeneous because the initial inhomogeneity is “kicked 
out”: the physical size of the perturbation, « a, grows faster than the curvature 
scale, H~' = a/a, while the perturbation amplitude does not change substantially. 
Since inhomogeneities are “devalued” within the curvature scale, the name “infla- 
tion” fairly captures the physical effect of accelerated expansion. The consideration 
above is far from rigorous. However, it gives the flavor of the “no-hair” theorem 
for an inflationary stage. 

To sum up, inflation demolishes large initial inhomogeneities and produces a 
homogeneous, isotropic domain. It follows from (5.13) that if we want to avoid 
the situation of a large initial perturbation re-entering the present horizon, ~H', 
and inducing a large inhomogeneity, we have to assume that the initial expansion 
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rate was much smaller than the rate of expansion today, that is, a; /d9 < 1. More 
precisely, the CMB observations require that the variation of the energy density 
on the present horizon scale does not exceed 10~>. The traces of an initial large 
inhomogeneity will be sufficiently diluted only if a; /a9 < 10~>. Rewriting (5.8) as 


5 2 
GEtTO.= v($) (5.14) 


ao 


we see that if |Q; — 1| ~ O(1) then 
Qo= 1 (5.15) 


to very high accuracy. This important robust prediction of inflation has a kinemat- 
ical origin and it states that the total energy density of all components of matter, 
irrespective of their origin, must be equal to the critical energy density today. 
We will see later that amplified quantum fluctuations lead to tiny corrections to 
Qo = 1, which are of order 10~°. It is worth noting that, in contrast to a deceler- 
ating universe where Q(t) > 1 ast — O, in an accelerating universe Q(t) > 1 as 
t — ov, that is, Q = 1 is its future attractor. 


Problem 5.3. Why does the consideration above fail for Q; = 0? 


5.3 How can gravity become “repulsive”? 


To answer this question we recall the Friedmann equation (1.66): 
: 4n 
a= Sag Che SP ig: (5.16) 


Obviously, if the strong energy dominance condition, ¢ + 3p > 0, is satisfied, then 
a <0 and gravity decelerates the expansion. The universe can undergo a stage 
of accelerated expansion with dG > 0 only if this condition is violated, that is, if 
€+3p <0. One particular example of “matter” with a broken energy dominance 
condition is a positive cosmological constant, for which py = —éy ande+ 3p = 
—2ey < 0. In this case the solution of Einstein’s equations is a de Sitter universe — 
discussed in detail in Sections 1.3.6 and 2.3. For t >> H,', the de Sitter universe 
expands exponentially quickly, a « exp(H, 1), and the rate of expansion grows as 
the scale factor. The exact de Sitter solution fails to satisfy all necessary conditions 
for successful inflation: namely, it does not possess a smooth graceful exit into the 
Friedmann stage. Therefore, in realistic inflationary models, it can be utilized only 
as a zero order approximation. To have a graceful exit from inflation we must allow 
the Hubble parameter to vary in time. 
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Let us now determine the general conditions which must be satisfied in a suc- 
cessful inflationary model. Because 


ee ee 
“=H +H, (5.17) 
a 


and @ should become negative during a graceful exit, the derivative of the Hubble 
constant, H, must obviously be negative. The ratio |H|/H? grows toward the end of 
inflation and the graceful exit takes place when | H| becomes of order H*. Assuming 
that H? changes faster than H , that is, |H| < 2H H, we obtain the following generic 
estimate for the duration of inflation: 


bp / al (5.18) 


where Hj; and H; refer to the beginning of inflation. At t ~ f ¢ the expression on 
the right hand side in (5.17) changes sign and the universe begins to decelerate. 

Inflation should last long enough to stretch a small domain to the scale of the 
observable universe. Rewriting the condition a; /a) < 10~> as 


aj af =: a; H; af < 1073 
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and taking into account that a /do should be larger than 10°8, we conclude that 
inflation is successful only if 
Hi; 
of SiO, 
aj Hy 
Let us assume that |H;| << H? and neglect the change of the Hubble parameter. 
Then the ratio of the scale factors can be roughly estimated as 


ar/a; ~~ exp(Hits) ~ exp(H?/|Hi|) > 10°. (5.19) 


Hence inflation can solve the initial conditions problem only if ty > 75 is Agee that 

is, it lasts longer than 75 Hubble times (e-folds). Rewritten in terms of the initial 

values of the Hubble parameter and its derivative, this condition takes the form 
al 


a 5.20 
He? 75 oe) 





Using the Friedmann equations (1.67) and (1.68) with k = 0, we can reformulate 
it in terms of the bounds on the initial equation of state 
(e+ Pp); 
€j 


a 10e (5.21) 
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Thus, at the beginning of inflation the deviation from the vacuum equation of 
state must not exceed 1%. Therefore an exact de Sitter solution is a very good 
approximation for the initial stage of inflation. Inflation ends when ¢ + p ~ e. 


Problem 5.4 Consider an exceptional case where || decays at the same rate as 
H?,thatis, H = —pH?, where p =const. Show that for p < 1 we have power-law 
inflation. This inflation has no natural graceful exit and in this sense is similar to a 
pure de Sitter universe. 


5.4 How to realize the equation of state p ~ —e 


Thus far we have used the language of ideal hydrodynamics, which is an adequate 
phenomenological description of matter on large scales. Now we discuss a simple 
field-theoretic model where the required equation of state can be realized. The 
natural candidate to drive inflation is a scalar field. The name given to such a field 
is the “inflaton.” We saw that the energy-momentum tensor for a scalar field can 
be rewritten in a form which mimics an ideal fluid (see (1.58)). The homogeneous 
classical field (scalar condensate) is then characterized by energy density 


e=i9°+ VQ), (5.22) 
and pressure 
p= 30° -V@). (5.23) 


We have neglected spatial derivatives here because they become negligible soon 
after the beginning of inflation due to the “no-hair” theorem. 


Problem 5.5 Consider a massive scalar field with potential V = smo", where 
m <mp;, and determine the bound on the allowed inhomogeneity imposed by 
the requirement that the energy density must not exceed the Planckian value. Why 
does the contribution of the spatial gradients to the energy-momentum tensor decay 
more quickly than the contribution of the mass term? 


It follows from (5.22) and (5.23) that the scalar field has the desired equation of 
state only if ¢? «< Wg). Because p = —e + @”, the deviation of the equation of 
state from that for the vacuum is entirely characterized by the kinetic energy, ¢, 
which must be much smaller than the potential energy V(g). Successful realization 
of inflation thus requires keeping @* small compared to V(g) during a sufficiently 
long time interval, or more precisely, for at least 75 e-folds. In turn this depends on 
the shape of the potential V(g). To determine which potentials can provide us with 
inflation, we have to study the behavior of a homogeneous classical scalar field in 
an expanding universe. The equation for this field can be derived either directly 
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from the Klein—Gordon equation (1.57) or by substituting (5.22) and (5.23) into the 
conservation law (1.65). The result is 


@+3H~+V,=0, (5.24) 


where V, = 0V/dq. This equation has to be supplemented by the Friedmann 
equation: 


1 
H? = = (50° + vw?) (5.25) 


where we have set G = 1 andk = 0. We first find the solutions of (5.24) and (5.25) 
for a free massive scalar field and then study the behavior of the scalar field in the 
case of a general potential V(¢). 


5.4.1 Simple example: V = 4m*p’. 
Substituting H from (5.25) into (5.24), we obtain the closed form equation for 9g, 


0 + V12n(¢? + m?)'” o + mo =0. (5.26) 


This is a nonlinear second order differential equation with no explicit time depen- 
dence. Therefore it can be reduced to a first order differential equation for ¢(¢). 
Taking into account that 

. a9 

g —_ Q dy ’ 
(5.26) becomes 


dp __ Vay? + m9?) 9+ me 





, (5.27) 


dp g 
which can be studied using the phase diagram method. The behavior of the solutions 
in the g—¢ plane is shown in Figure 5.3. The important feature of this diagram is the 
existence of an attractor solution to which all other solutions converge in time. One 
can distinguish different regions corresponding to different effective equations of 
state. Let us consider them in more detail. We restrict ourselves to the lower right 
quadrant (gy > 0, @ < 0); solutions in the other quadrants can easily be derived 
simply by taking into account the symmetry of the diagram. 


Ultra-hard equation of state First we study the region where |~| >> m@. It describes 
the situation when the potential energy is small compared to the kinetic energy, so 
that g* >> V. It follows from (5.22) and (5.23) that in this case the equation of state 
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Fig. 5.3. 


is ultra-hard, p ~ +e. Neglecting mg compared to @ in (5.27), we obtain 


, 
i. ~ Jiro. (5.28) 
Q 


The solution of this equation is 
g=C exp(¥ 29) ; (5.29) 


where C < 0 is a constant of integration. In turn, solving (5.29) for g(t) gives 


1 
gy = const — ors Int. (5.30) 


Substituting this result into (5.25) and neglecting the potential term, we obtain 


w=(4 ee : (5.31) 
“\a) ~~ 9f2° : 


1/3 6 


It immediately follows that a « t’/° and ¢ « a~” in agreement with the ultra-hard 
equation of state. Note that the solution obtained is exact for a massless scalar field. 
According to (5.29) the derivative of the scalar field decays exponentially more 
quickly than the value of the scalar field itself. Therefore, the large initial value 
of || is damped within a short time interval before the field g itself has changed 
significantly. The trajectory which begins at large |~| goes up very sharply and 
meets the attractor. This substantially enlarges the set of initial conditions which 
lead to an inflationary stage. 


Inflationary solution If a trajectory joins the attractor where it is flat, at || >> 1, 
then afterwards the solution describes a stage of accelerated expansion (recall that 
we work in Planckian units). To determine the attractor solution we assume that 
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do/dg ~ 0 along its trajectory. It follows from (5.27) that 


m 
Dar ~ — ’ 5.32 
Pat J 120 (5.32) 


and therefore 


(t—t;)) x (9.33) 


m m 
Pal) = 01 — aoe 

where f; is the time when the trajectory joins the attractor and ty is the moment 

when ¢ formally vanishes. In reality, (5.33) fails well before the field g vanishes. 


Problem 5.6 Calculate the corrections to the approximate attractor solution (5.32) 
and show that 


Gar = (1 = 5(Vi2ry) Bs o((viare) °)) (5.34) 


The corrections to (5.32) become of order the leading term when g ~ O(1), that 
is, when the scalar field value drops to the Planckian value or, more precisely, to 
g ~ 1/12 ~ 1/6. Hence (5.33) is a good approximation only when the scalar 
field exceeds the Planckian value. This does not mean, however, that we require 
a theory of nonperturbative quantum gravity. Nonperturbative quantum gravity 
effects become relevant only if the curvature or the energy density reaches the 
Planckian values. However, even for very large values of the scalar field they can still 
remain in the sub-Planckian domain. In fact, considering a massive homogeneous 
field with negligible kinetic energy we infer that the energy density reaches the 
Planckian value for g ~ m7!. Therefore, if m « 1, then for m~! > g > 1 wecan 
safely disregard nonperturbative quantum gravity effects. 

According to (5.33) the scalar field decreases linearly with time after joining the 
attractor. During the inflationary stage 


p~—e +m? /120. 


So when the potential energy density ~ m*gy?, which dominates the total energy 
density, drops to m”, inflation is over. At this time the scalar field is of order unity 
(in Planckian units). 

Let us determine the time dependence of the scale factor during inflation. Substi- 
tuting (5.33) into (5.25) and neglecting the kinetic term, we obtain a simple equation 
which is readily integrated to yield 


2 
a(t) ~ ar exp(-"(r — ‘’) ~ a; exp (se _ :)) ‘ (5.35) 


where a; and H; are the initial values of the scale factor and the Hubble parameter. 
Note that the Hubble constant H(t) ~ ./42/3m@(t) also linearly decreases with 
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time. It follows from (5.33) that inflation lasts for 


At ~ ts —t; ~ V120(g;/m). (5.36) 
During this time interval the scale factor increases 
of a exp(27 97) (5.37) 
qj 


times. The results obtained are in good agreement with the previous rough estimates 
(5.18) and (5.19). Inflation lasts more than 75 e-folds if the initial value of the scalar 
field, g;, is four times larger than the Planckian value. To obtain an estimate for the 
largest possible increase of the scale factor during inflation, let us consider a scalar 
field of mass 10!? GeV. The maximal possible value of the scalar field for which 
we still remain in the sub-Planckian domain is yg; ~ 10°, and hence 


(<4) ~ exp(10!) . (5.38) 
Gi / max 

Thus, the actual duration of the inflationary stage can massively exceed the 75 
e-folds needed. In this case our universe would constitute only a very tiny piece of 
an incredibly large homogeneous domain which originated from one causal region. 
The other important feature of inflation is that the Hubble constant decreases only 
by a factor 10~°, while the scale factor grows by the tremendous amount given in 
(5.38), that is, 





H; K af 
Hy aj ; 


Graceful exit and afterwards After the field drops below the Planckian value it 
begins to oscillate. To determine the attractor behavior in this regime we note that 


3 
go +m ¢” = —H? (5.39) 
4n 


and use the Hubble parameter H and the angular variable 6, defined via 


/ 3 / 3 
¢g¢=,/—Hsind, mp =,/—Hcos9, (5.40) 
4a An 


as the new independent variables. It is convenient to replace (5.27) by a system of 
two first order differential equations for H and 6: 


H = —3H’ sin’ 6, (5.41) 
3 
6 = —m — 5H sin 20, (5.42) 
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where a dot denotes the derivative with respect to physical time t. The second term 
on the right hand side in (5.42) describes oscillations with decaying amplitude, as 
is evident from (5.41). Therefore, neglecting this term we obtain 


6~—-—mt+a, (5.43) 


where the constant phase a can be set to zero. Thus, the scalar field oscillates 
with frequency w ~ m. After substituting 9 ~ —mt into (5.41), we obtain a readily 
integrated equation with solution 


ef GY. sin(2mt)\~! 
nra(2)=2(-82m)", 


where a constant of integration is removed by atime shift. This solution is applicable 
only for mt >> 1. Therefore the oscillating term is small compared to unity and the 
expression on the right hand side in (5.44) can be expanded in powers of (mt)~!. 
Substituting (5.43) and (5.44) into the second equation in (5.40), we obtain 








sg cos(mt) sin(2mt) 3 
g(t) = aaa — ) + (mn ). (5.45) 


The time dependence of the scale factor can easily be derived by integrating (5.44): 





cos(2mt) 1 _ 
aar (1 at = go + O((mt) ) ; (5.46) 
Thus, in the leading approximation (up to decaying oscillating corrections), the 
universe expands like a matter-dominated universe with zero pressure. This is not 
surprising because an oscillating homogeneous field can be thought of as a con- 
densate of massive scalar particles with zero momenta. Although the oscillating 
corrections are completely negligible in the expressions for a(t) and H(t), they 
must nevertheless be taken into account when we calculate the curvature invari- 
ants. For example, the scalar curvature is 


4 -1 
Re =30 + 3cos(2mt) + O((mt)')) (5.47) 
(compare to R = —4/3r? in a matter-dominated universe). 


We have shown that inflation with a smooth graceful exit occurs naturally in 
models with classical massive scalar fields. If the mass is small compared to the 
Planck mass, the inflationary stage lasts long enough and is followed by a cold- 
matter-dominated stage. This cold matter, consisting of heavy scalar particles, must 
finally be converted to radiation, baryons and leptons. We will see later that this 
can easily be achieved in a variety of ways. 
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5.4.2 General potential: slow-roll approximation 


Equation (5.24) for a massive scalar field in an expanding universe coincides with 
the equation for a harmonic oscillator with a friction term proportional to the Hubble 
parameter H. It is well known that a large friction damps the initial velocities and 
enforces a slow-roll regime in which the acceleration can be neglected compared to 
the friction term. Because for a general potential H « ./e ~ V/V, we expect that 
for large values of V the friction term can also lead to a slow-roll inflationary stage, 
where @¢ is negligible compared to 3H @. Omitting the @ term and assuming that 
g? « V, (5.24) and (5.25) simplify to 


dl 8 
3Ho+ Vy X0, 4 =( a) ay EV. (5.48) 


Taking into account that 


dina ,dina_ = V dina 














Ge ig. © Sh dee 
equations (5.48) give 
dina 
=V.¢ ae ~ 81V (5.49) 
and hence 
i V 
a(g) ~ a; exp(8x [ 740) : (5.50) 
y Vo 


This approximate solution is valid only if the slow-roll conditions 
I7]<KIVI. = 11 K3HO~[Vool, (5.51) 


used to simplify (5.24) and (5.25), are satisfied. With the help of (5.48), they can 
easily be recast in terms of requirements on the derivatives of the potential itself: 
Vig 


Vo\’ 
— <i, 
(5) V 


For a power-law potential, V =(1/n) A”, both conditions are satisfied for |g| >> 1. 
In this case the scale factor changes as 


<1. (5.52) 








4a 
a(p(t)) = a; exo (= = ro) (5:53) 
It is obvious that the bulk of the inflationary expansion takes place when the scalar 
field decreases by a factor of a few from its initial value. However, we are interested 
mainly in the last 50-70 e-folds of inflation because they determine the structure 
of the universe on present observable scales. The detailed picture of the expansion 
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during these last 70 e-folds depends on the shape of the potential only within a 
rather narrow interval of scalar field values. 


Problem 5.7 Find the time dependence of the scale factor for the power-law po- 
tential and estimate the duration of inflation. 


Problem 5.8 Verify that for a general potential V the system of equations (5.24), 
(5.25) can be reduced to the following first order differential equation: 


dy 9 Vy 
— = —3(1 — 1 ; ‘ 5.54 
a (l—y i( + a) (5.54) 


_ 4m _ 4n do 
WS Va 2 NOS dine 


Assuming that V,,/V — 0 as |g| — ov, draw the phase diagram and analyze the 
behavior of the solutions in different asymptotic regions. Consider separately the 
case of the exponential potential. What is the physical meaning of the solutions in 
the regions corresponding to y > 1? 





where 





After the end of inflation the scalar field begins to oscillate and the universe enters 
the stage of deceleration. Assuming that the period of oscillation is smaller than the 
cosmological time, let us determine the effective equation of state. Neglecting the 
expansion and multiplying (5.24) by g, we obtain 


(90) —¢° + 9Vy_ ~0. (5.55) 


As a result of averaging over a period, the first term vanishes and hence (07) end 
(pV 9). Thus, the averaged effective equation of state for an oscillating scalar 
field is 


_P (pV.o) = 24) 
— &  (pVg) + (2V) | 





(5.56) 


It follows that for V « g” we have w ~(n — 2) /(n + 2). For an oscillating massive 
field(n = 2) we obtain w ~ O in agreement with our previous result. In the case 
of a quartic potential(n = 4), the oscillating field mimics an ultra-relativistic fluid 
with w ~ 1/3. 

In fact, inflation can continue even after the end of slow-roll. Considering the 
potential which behaves as 


V ~ In(lg| /¢-) 
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for 1 > |y| >> @ (see Figure 5.4), we infer from (5.56) that w — —1. This is easy 
to understand. In the case of a convex potential, an oscillating scalar field spends 
most of the time near the potential walls where its kinetic energy is negligible and 
hence the main contribution to the equation of state comes from the potential term. 


Problem 5.9 Which general conditions must a potential V satisfy to provide a 
stage of fast oscillating inflation? How long can such inflation last and why is it not 
very helpful for solving the initial conditions problem? 


5.5 Preheating and reheating 


The theory of reheating is far from complete. Not only the details, but even the over- 
all picture of inflaton decay depend crucially on the underlying particle physics 
theory beyond the Standard Model. Because there are so many possible exten- 
sions of the Standard Model, it does not make much sense to study the particulars 
of the reheating processes in each concrete model. Fortunately we are interested 
only in the final outcome of reheating, namely, in the possibility of obtaining a 
thermal Friedmann universe. Therefore, to illustrate the physical processes which 
could play a major role we consider only simple toy models. The relative impor- 
tance of the different reheating mechanisms cannot be clarified without an under- 
lying particle theory. However, we will show that all of them lead to the desired 
result. 
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5.5.1 Elementary theory 


We consider an inflaton field g of mass m coupled to a scalar field x and a spinor field 
w. Their simplest interactions are described by three-legged diagrams (Figure 5.5), 
which correspond to the following terms in the Lagrangian: 


ALin. = —89x’ —hovy. (5.57) 


We have seen that these kinds of couplings naturally arise in gauge theories with 
spontaneously broken symmetry, and they are enough for our illustrative purposes. 
To avoid a tachyonic instability we assume that |gq| is smaller than the squared 
“bare” mass m>. The decay rates of the inflaton field into x x and ww pairs are 
determined by the coupling constants g and / respectively. They can easily be 
calculated and the corresponding results are cited in books on particle physics: 
g h2m 
r, =l"@- 0 aaa a ry =T@-> yw) = —. (5.58) 
mm 8x 
Let us apply these results in order to calculate the decay rate of the inflaton. As 
we have noted, an oscillating homogeneous scalar field can be interpreted as a 
condensate of heavy particles of mass m “at rest,” that is, their 3-momenta k are 
equal to zero. Keeping only the leading term in (5.45), we have 


g(t) ~ P(t) cos(mt) , (5.59) 


where ®(f) is the slowly decaying amplitude of oscillations. The number density 
of g particles can be estimated as 

fp ~ a 5 (9 + m9?) ~ sm? (5.60) 
This number is very large. For example, for m ~ 10! GeV, we have ny ~ 10” 
> immediately after the end of inflation, when ® ~ 1 in Planckian units. 

One can show that quantum corrections do not significantly modify the inter- 
actions (5.57) only if g < m and h < m'/”. Therefore, for m < m p;, the highest 
decay rate into x particles, , ~ m, is much larger than the highest possible rate 
for the decay into fermions, 'y ~ m?. If g ~ m, then the lifetime of a @ particle 
is about I ~ m7! and the inflaton decays after a few oscillations. Even if the 


cm 
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coupling is not so large, the decay can still be very efficient. The reason is that the 
effective decay rate into bosons, Te, is equal to [',, given in (5.58), only if the 
phase space of x particles is not densely populated by previously created y parti- 
cles. Otherwise Ie can be made much larger by the effect of Bose condensation. 
This amplification of the inflaton decay is discussed in the next section. 

Taking into account the expansion of the universe, the equations for the number 
densities of the g and x particles can be written as 


1 d(a°ng) ae 1 d(a*n,) 
Bae Se eae 


where the coefficient 2 in the second equation arises because one ¢ particle decays 
into two x particles. 


=> Teng, (5.61) 


Problem 5.10 Substituting (5.60) into the first equation in (5.61), derive the ap- 
proximate equation 


G+BH +Ter)@ +mo ~ 0, (5.62) 


which shows that the decay of the inflaton amplitude due to particle production 
may be roughly taken into account by introducing an extra friction term Degg. Why 
is this equation applicable only during the oscillatory phase? 


5.5.2 Narrow resonance 


The domain of applicability of elementary reheating theory is limited. Bose con- 
densation effects become important very soon after the beginning of the inflaton 
decay. Because the inflaton particle is “at rest,” the momenta of the two produced x 
particles have the same magnitude k but opposite directions. If the corresponding 
states in the phase space of x particles are already occupied, then the inflaton decay 
rate is enhanced by a Bose factor. The inverse decay process x x — gy can also take 
place. The rates of these processes are proportional to 


(ny —lInmtl,n_.t+ 1| aya", Ing, Nk, n_x)| =(ny + I(n_x~ t+ 1)ny 
and 


I(r +1,n,—-1,n__K - 1| ay a, a, lis Nk, n_x)|" = NkN—_ (Ng + 1) 


respectively, where ae are the creation and annihilation operators for x particles 


and ni, are their occupation numbers. To avoid confusion the reader must always 
distinguish the occupation numbers from the number densities keeping in mind 
that the occupation number refers to a density per cell of volume (27)? (in the 
Planckian units) in the phase space, while the number density is the number of 
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particles per unit volume in the three-dimensional space. Taking into account that 
Nk = N_~ =n, and ng > |, we infer that the number densities ng and n, satisfy 
(5.61), where 


Tere & r,d + 2nx). (5.63) 


Given a number density n,, let us calculate n;. A particle “at rest” decays into 
two x particles, both having energy m/2. Because of the interaction term (5.57), 
the effective squared mass of the x particle depends on the value of the inflaton 
field and is equal to ms, +2gqg(t). Therefore the corresponding 3-momentum of 
the produced x particle is given by 


Hee 1/2 
k= ((3) — mm? — 2a) (5.64) 


where we assume that m5, +220 « m’*. The oscillating term, 
ep = gPcos(mt), 


leads to a “scattering” of the momenta in phase space. If g® « m”/8, then all 
particles are created within a thin shell of width 


4g@ 
Ak~m a | om (5.65) 
m 
located near the radius ky ~ m/2 (Figure 5.6(a)). Therefore 
ny os 2nn, _ 1° ® ny 


Nk=m/2 = ~ = 
‘m2 (ank2Ak)/Qny mg®—«gsog 





(5.66) 


ic Ake (49) 














(a) (b) 
Fig. 5.6. 
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The occupation numbers nx exceed unity, and hence the Bose condensation effect 
is essential only if 


(5.67) 


Hin nip 
Taking into account that at the end of inflation ® ~ 1, we infer that the occupation 
numbers begin to exceed unity as soon as the inflaton converts a fraction g of its 
energy to x particles. The derivation above is valid only for g < m”/8. Therefore, 
if m ~ 10~®, then at most a fraction g ~ m? ~ 107? of the inflaton energy can be 
transferred to x particles in the regime where nz < 1. Thus, the elementary theory 
of reheating, which is applicable for nz < 1, fails almost immediately after the 
beginning of reheating. Given the result in (5.66), the effective decay rate (5.63) 


becomes 
: 217 ® 
Pax sf (14 e “2, (5.68) 
8m g Ng 





where we have used (5.58) for I’,. Substituting this expression into the second 
equation in (5.61), we obtain 


1 d(a?n,) ge 2n?On 
= 1 A : 5.69 
a> dN 2m? ( = g “) ve Ce) 





where N = mt/2z is the number of inflaton oscillations. Let us neglect for a 
moment the expansion of the universe and disregard the decrease of the inflation 
amplitude due to particle production. In this case ® = const and for nz >> 1 (5.69) 
can be easily integrated. The result is 





mg 
Ha 5c exp( 5 ) x exp(27 uN), (5.70) 
m 


where «» = 1g0/ (2m?) is the parameter of instability. 


Problem 5.11 Derive the following equation for the Fourier modes of the field x 
in Minkowski space: 


ie +(k° +m), + 2g cos mt) x, = 0. (5.71) 


Reduce it to the well known Mathieu equation and, assuming that m* > my > 
2 |g®|, investigate the narrow parametric resonance. Determine the instability bands 
and the corresponding instability parameters. Compare the width of the first insta- 
bility band with (5.65). Where is this band located? The minimal value of the initial 
amplitude of x; is due to vacuum fluctuations. The increase of x, with time can be 
interpreted as the production of x particles by the external classical field g, with 
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ny X lel. Show that in the center of the first instability band, 


4ng® 
ny, exp( se v). (5.72) 


where WN is the number of oscillations. Compare this result with (5.70) and explain 
why they are different by a numerical factor in the exponent. Thus, Bose conden- 
sation can be interpreted as a narrow parametric resonance in the first instability 
band, and vice versa. Give a physical interpretation of the higher-order resonance 
bands in terms of particle production. 





Using the results of this problem we can reduce the investigation of the inflaton 
decay due to the coupling 


ALin = —329°x°, (5.73) 


to the case studied above. In fact, the equation for a massless scalar field x, coupled 
to the inflaton g = ®cos mt, takes the form 


Xe +(k? + B®? cos” mt) xx = 0, (5.74) 


which coincides with (5.71) for my = 2g® after the substitutions 27* > 4g 
and m — m/2. Thus, the two problems are mathematically equivalent. Using this 
observation and making the corresponding replacements in (5.72), we immediately 
find that 


52 a2 


ny oc exp( ES v). (5.75) 





The condition for narrow resonance is ® < m and the width of the first resonance 
band can be estimated from (5.65) as Ak ~ m (%?®?/m7?), 

In summary, we have shown that even for a small coupling constant the elemen- 
tary theory of reheating must be modified to take into account the Bose condensation 
effect, and that this can lead to an exponential increase of the reheating efficiency. 


Problem 5.12 Taking a few concrete values for g and m, compare the results of 
the elementary theory with those obtained for narrow parametric resonance. 


So far we have neglected the expansion of the universe, the back-reaction of 
the produced particles and their rescatterings. All these effects work to suppress 
the efficiency of the narrow parametric resonance. The expansion shifts the mo- 
menta of the previously created particles and takes them out of the resonance layer 
(Figure 5.6(a)). Thus, the occupation numbers relevant for Bose condensation are 
actually smaller than what one would expect according to the naive estimate (5.66). 
If the rate of supply of newly created particles in the resonance layer is smaller 
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than the rate of their escape, then n; < 1 and we can use the elementary theory 
of reheating. The other important effect is the decrease of the amplitude ®(f) due 
to both the expansion of the universe and particle production. Because the width 
of the resonance layer is proportional to ®, it becomes more and more narrow. 
As a result the particles can escape from this layer more easily and they do not 
stimulate the subsequent production of particles. The rescattering of the x particles 
also suppresses the resonance efficiency by removing particles from the resonance 
layer. Another effect is the change of the effective inflaton mass due to the newly 
produced x particles; this shifts the center of the resonance layer from its original 
location. 

To conclude, narrow parametric resonance is very sensitive to the interplay of 
different complicating factors. It can be fully investigated only using numerical 
methods. From our analytical consideration we can only say that the inflaton field 
probably decays not as “slowly” as in the elementary theory, but not as “fast” as in 
the case of pure narrow parametric resonance. 


5.5.3 Broad resonance 


So far we have considered only the case of a small coupling constant. Quantum 
corrections to the Lagrangian are not very crucial if g < mand % < (m/®)!/?, They 
can therefore be ignored when we consider inflaton decay in the strong coupling 
regime: m > g > m7/® for the three-leg interaction and (m/®)!/? > & > m/® for 
the quartic interaction (5.73). In this case the condition for narrow resonance is not 
fulfilled and we cannot use the methods above. Perturbative methods fail because 
the higher-order diagrams, built from the elementary diagrams, give comparable 
contributions. Particle production can be treated only as a collective effect in which 
many inflaton particles participate simultaneously. We have to apply the methods 
of quantum field theory in an external classical background — as in Problem 5.11. 

Let us consider quartic interaction (5.73). First, we neglect the expansion of the 
universe. For g® >> m the mode equation (see (5.74)): 


Xe + w(t) xn = 0, (5.76) 


where 


2 


w(t) =(k? + 8°@* cos’ mt)", (5.77) 


describes a broad parametric resonance. If the frequency w(f) is a slowly vary- 
ing function of time or, more precisely, |@| < w”, (5.76) can be solved in the 
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quasiclassical (WKB) approximation: 


1 
Xn x we exp( [ex) ; (5.78) 


In this case the number of particles, n, ~ €,/q@, is an adiabatic invariant and is 
conserved. For most of the time the condition |w| « w* is indeed fulfilled. However, 
every time the oscillating inflaton vanishes at t; = m—|(j + 1/2)z, the effective 
mass of the x field, proportional to |cos(mt)|, vanishes. It is shortly before and after 
t; that the adiabatic condition is strongly violated: 


|a| aa mg? ®? |cos(mt) sin(mt)| 
‘ (k? + B°@? cos*(mt)) = 





(5.79) 
a) 


Considering a small time interval At < m7! in the vicinity of t;, we can rewrite 
this condition as 





At/At, 
v2! (5.80) 
(k?Ar2 +(At/At,)”) 
where 
1 
At, (2m) 1? = —(@@/m)'. (5.81) 
m 


It follows that the adiabatic condition is broken only within short time intervals 
At ~ At, near t; and only for modes with 


k<k,y = Atr! ~ m(B0/m)'”. (5.82) 


Therefore, we expect that x particles with the corresponding momenta are created 
only during these time intervals. It is worth noting that the momentum of the 
created particle can be larger than the inflaton mass by the ratio(®/m)!/* > 1; the 
X particles are produced as a result of a collective process involving many inflaton 
particles. This is the reason why we cannot describe the broad resonance regime 
using the usual methods of perturbation theory. 

To calculate the number of particles produced in a single inflaton oscillation we 
consider a short time interval in the vicinity of t; and approximate the cosine in 
(5.76) by a linear function. Equation (5.76) then takes the form 


d*X« 
dt? 
where the dimensionless wavenumber x = k/k, and time t = (t —t i) / At, have 
been introduced. In terms of the new variables the adiabaticity condition is broken 


at |t| < 1 and only for « < 1. It is remarkable that the coupling constant g, the 
mass and the amplitude of the inflaton enter explicitly only in «”. The adiabaticity 





+(k? + t’) Xe = 0, (5.83) 
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violation is largest for k = 0. In this case the parameters g, ® and m drop from 
(5.83) and the amplitude x,,=9 changes only by a numerical, parameter-independent 
factor as a result of passing through the nonadiabatic region at |t| < 1. Because 
the particle density n is proportional to |x|”, its growth from one oscillation to the 
next can be written as 





nitl 
( ) = exp(27 L=0) (5.84) 
nm J 0 

where the instability parameter j1p,=0 does not depend on g, ® and m. For modes with 
k £0, the parameter jz409 is a function of k = k/k,. In this case the adiabaticity 
is not violated as strongly as for the k = 0 mode, and hence jz40 is smaller than 
[4x=0- To calculate the instability parameters we have to determine the change of 
the amplitude x in passing from the t < —1 region to the t > 1 region. This can 
be done using two independent WKB solutions of (5.83) in the asymptotic regions 
|t| > 1: 


1 igij fur 
t= i en0( i / VK2+ Par) oe |c[-2t2i exp ("= . (5.85) 
(k? + 1?) 2 





NI 


After passing through the nonadiabatic region the mode A, x4 becomes a mixture 
of the modes x + and x_ , that is, 


A+X4 > Bx +Cyx-, (5.86) 





where A, B, and C, are the complex constant coefficients. Similarly, for the 
mode A_yx_, we have 


A_x~ > B_xy_+C_ x4. (5.87) 





Drawing an analogy with the scattering problem for the inverse parabolic potential, 
we note that the mixture arises due to an overbarrier reflection of the wave. The 
reflection is most efficient for the waves with k = 0 which “touch” the top of the 
barrier. 

The quasi-classical solution is valid in the complex plane for |t| >> 1. Traversing 
the appropriate contour t = |t| e’” in the complex plane from t < —1 tot > 1, 
we infer from (5.85), (5.86) and (5.87) that 


Be Sie F* Ay, (5.88) 








The coefficients C4 are not determined in this method. To find them we use the 
Wronskian 


W=xx*- xx’, (5.89) 
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where x is an arbitrary complex solution of (5.83). Taking the derivative of W and 
using (5.83) to express X in terms of x, we find 


W=0 (5.90) 


and hence W = const. From this we infer that the coefficients A, B and C in (5.86) 
and (5.87) satisfy the “probability conservation” condition 


iCal? — |Bal? = |Aal’. (5.91) 





Substituting B from (5.88), we obtain 


CeevVite-™ [Ag] el, (5.92) 


where the phases a remain undetermined. 

At |t| >> 1 the modes of field x satisfy the harmonic oscillator equation with a 
slowly changing frequency w « |t|. In quantum field theory the occupation number 
nx in the expression for the energy of the harmonic oscillator, 











Ee = Ong + 1/2), (5.93) 


is interpreted as the number of particles in the corresponding mode k. In the adiabatic 
regime (|t| >> 1) this number is conserved and it changes only when the adiabatic 
condition is violated. Let us consider an arbitrary initial mixture of the modes x+ 
and x_. After passing through the nonadiabatic region at ¢ ~ f;, it changes as 


xX) =Agye + AK > xT = (BE + C_) x4 +H(B-+ Cy) x-. (5.94) 





Taking into account that 


1 
ae eee ee are (5.95) 
2 0) 


we see that as a result of this passage the number of particles in the mode k increases 
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times, where we have averaged |x \* over the time interval m~! > t > w7!. With 
B and C from (5.88) and (5.92), this expression becomes 


+14 1/2 4|A_||A x 
(SE) = (1p 2ee) + SE cosa VIE. 
k 


ni + 1/2 |A4|? + |A_/? 
(5.97) 
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Problem 5.13 Verify (5.97) and explain the origin of the phase @. (Hint Derive 
and use the relation Re B;C_ = Re B_C, which follows from the “probability 
conservation” condition for (5.94).) 


In the vacuum initial state n, =O but the amplitude of the field x, does not 
vanish because of the existence of vacuum fluctuations; we have |A¢. |? # 0 and 


|A°|? = 0. It follows from the “probability conservation” condition that 
|A4? — |A_? = [a2]? (5.98) 


at every moment of time. This means that as a result of particle production the 
coefficients | A+ |? and |A_|? grow by the same amount. When |A,| becomes much 
larger than |A{| we have |A+| ~ |A_|. Taking this into account and beginning in 
the vacuum state, we find from (5.97) that after N > 1 inflaton oscillations the 
particle number in mode k is 


lad 5 exp(27 uN), (5.99) 
where the instability parameter is given by 
1 x 
pe In(1 4267" 4. 2cosGe72 V1 + em) (5.100) 
a 


This parameter takes its maximal value 
mex — al In(1 ef v2) ~ 0.28 


for k = 0 and 6 = 0. In the interval —z < 6 < 7 we find that j4;,~0 is positive if 
37/4 > 0 > —37/4 and negative otherwise. Thus, assuming random 6, we con- 
clude that the particle number in every mode changes stochastically. However, if all 
§ are equally probable, then the number of particles increases three quarters of the 
time and therefore it also increases on average, in agreement with entropic argu- 
ments. The net instability parameter, characterizing the average growth in particle 
number, is obtained by skipping the cos @ term in (5.100): 


1 
ie in(1 a 20") (5.101) 
20 


With slight modifications the results above can be applied to an expanding uni- 
verse. First of all we note that the expansion randomizes the phases @ and hence 
the effective instability parameter is given by (5.101). For particles with physi- 
cal momenta k < k,/./z, the instability parameter ji; can be roughly estimated 
by its value at the center of the instability region, j4z,-9 = (In3)/2m7 ~ 0.175. To 
understand how the expansion can influence the efficiency of broad resonance, it 
is again helpful to use the phase space picture. The particles created in the broad 
resonance regime occupy the entire sphere of radius k,/./7 in phase space (see 
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Figure 5.6(b)). During the passage through the nonadiabatic region the number of 
particles in every cell of the sphere, and hence the total number density, increases on 
average exp(27 x 0.175) ~ 3 times. At the stage when inflaton energy is still domi- 
nant, the physical momentum of the created particle decreases in inverse proportion 
to the scale factor (k x a), while the radius of the sphere shrinks more slowly, 
namely, as ®!/? x t7!/? « a~3/4. As a result, the created particles move away from 
the boundary of the sphere towards its center where they participate in the next “act 
of creation,’ enhancing the probability by a Bose factor. Furthermore, expansion 
also makes broad resonance less sensitive to rescattering and back-reaction effects. 
These two effects influence the resonance efficiency by removing those particles 
which are located near the boundary of the resonance sphere. Because expansion 
moves particles away from this region, the impact of these effects is diminished. 
Thus, in contrast to the narrow resonance case, expansion stabilizes broad resonance 
and at the beginning of reheating it can be realized in its pure form. 
Taking into account that the initial volume of the resonance sphere is about 


ky = m*(B®o/m)” , 
we obtain the following estimate for the ratio of the particle number densities after 
N inflaton oscillations: 


ny k3 exp(27 jtr-0N) ~ mil2g3/2 38 





: ae? (5.102) 
where ®y ~ O(1) is the value of the inflaton amplitude after the end of inflation. 
Since in the adiabatic regime the effective mass of the x particles is of order g®, 
where ® decreases in inverse proportion to N, we also obtain an estimate for the 
ratio of the energy densities: 

ox A U2 g5/2N 13, (5.103) 

&y mNg 
The formulae above fail when the energy density of the created particles begins 
to exceed the energy density stored in the inflaton field. In fact, at this time, the 
amplitude ®(t) begins to decrease very quickly because of the very efficient energy 
transfer from the inflaton to the x particles. Broad resonance is certainly over when 
@(t) drops to the value ®, ~ m/g, and we enter the narrow resonance regime. For 
the coupling constant ./m > 8 > O(1)m, the number of the inflaton oscillation 
N, in the broad resonance regime can be roughly estimated using the condition 
By Ey: 


N, ~ (0.75-2) log; m~'. (5.104) 


As an example, if m ~ 10!* GeV, we have N, ~ 10-25 for a wide range of the 
coupling constants 10-3 > > 107°. Taking into account that the total energy 
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decays as m?(®o/N)° , we obtain 


E m2@2 of m 2 
Pow 7. ~ wi(z ) (5.105) 
Ey ty  m?(Bo/N,) &Do 








that is, the energy still stored in the inflaton field at the end of broad resonance is 
only a small fraction of the total energy. In particular, form ~ 10!3 GeV, this ratio 
varies in the range 10~°—O(1) depending on the coupling constant 8. 


Problem 5.14 Investigate inflaton decay due to the three-leg interaction in the 
strong coupling regime: m > g > m?/®. 


5.5.4 Implications 


It follows from the above considerations that broad parametric resonance can play 
a very important role in the preheating phase. During only 15—25 oscillations of the 
inflaton, it can convert most of the inflaton energy into other scalar particles. The 
most interesting aspect of this process is that the effective mass and the momenta 
of the particles produced can exceed the inflaton mass. For example, for m ~ 10!4 
GeV, the effective mass mst = £® |cos(mt)| can be as large as 10!° GeV. Therefore, 
if the x particles are coupled to bosonic and fermionic fields heavier than the 
inflaton, then the inflaton may indirectly decay into these heavy particles. This 
brings Grand Unification scales back into play. For instance, even if the inflation 
ends at low energy scales, preheating may rescue the GUT baryogenesis models. 
Another potential outcome of the above mechanism is the far-from-equilibrium 
production of topological defects after inflation. Obviously their numbers must not 
conflict with observations and this leads to cosmological bounds on admissible 
theories. 

If, after the period of broad resonance, the slightest amount of the inflaton re- 
mained — given by (5.105) — it would be a cosmological disaster. Since the inflaton 
particles are nonrelativistic, if they were present in any substantial amount, they 
would soon dominate and leave us with a cold universe. Fortunately, these particles 
should easily decay in the subsequent narrow resonance regime or as a result of el- 
ementary particle decay. These decay channels thus become necessary ingredients 
of the reheating theory. 

The considerations of this section do not constitute a complete theory of reheat- 
ing. We have studied only elementary processes which could play arole in producing 
a hot Friedmann universe. The final outcome of reheating must be matter in thermal 
equilibrium. The particles which are produced in the preheating processes are ini- 
tially in a highly nonequilibrium state. Numerical calculations show that as a result 
of their scatterings they quickly reach local thermal equilibrium. Parameterizing 
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the total preheating and reheating time in terms of the inflaton oscillations number 
Nr, we obtain the following estimate for the reheating temperature: 


ie 


er 
# NI? NIA 


(5.106) 
where N is the effective number degrees of freedom of the light fields at T ~ Tr. 
Assuming that Nr ~ 10°, and taking N ~ 107 and m ~ 10!? GeV, we obtain Tr ~ 
10!* GeV. This does not mean, however, that we can ignore physics beyond this 
scale. As we have already pointed out, nonequilibrium preheating processes can 
play a nontrivial role. 

Reheating is an important ingredient of inflationary cosmology. We have seen 
that there is no general obstacle to arranging successful reheating. A particle theory 
should be tested on its ability to realize reheating in combination with baryogenesis. 
In this way, cosmology enables us to preselect realistic particle physics theories 
beyond the Standard Model. 


5.6 “Menu” of scenarios 


All we need for successful inflation is a scalar condensate satisfying the slow-roll 
conditions. Building concrete scenarios then becomes a “technical” problem. In- 
volving two or more scalar condensates, and assuming them to be equally relevant 
during inflation, extends the number of possibilities, but simultaneously diminishes 
the predictive power of inflation. This especially concerns cosmological perturba- 
tions, which are among the most important robust predictions of inflation. Because 
inflation can be falsified experimentally (or more accurately, observationally) only 
if it makes such predictions, we consider only simple scenarios with a single in- 
flaton component. Fortunately all of them lead to very similar predictions which 
differ only slightly in the details. This makes a unique scenario, the one actually 
realized in nature, less important. The situation here is very different from particle 
physics, where the concrete models are as important as the ideas behind them. This 
does not mean we do not need the correct scenario; if one day it becomes avail- 
able, we will be able to verify more delicate predictions of inflation. However, even 
in the absence of the true scenario, we can nonetheless verify observationally the 
most important predictions of the stage of cosmic acceleration. The purpose of this 
section is to give the reader a very brief guide to the “menu of scenarios” discussed 
in the literature. 


Inflaton candidates The first question which naturally arises is “what is the most 
realistic candidate for the inflaton field?”’. There are many because the only require- 
ment is that this candidate imitates a scalar condensate in the slow-roll regime. This 
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can be achieved by a fundamental scalar field or by a fermionic condensate described 
in terms of an effective scalar field. This, however, does not exhaust all possibilities. 
The scalar condensate can also be imitated entirely within the theory of gravity it- 
self. Einstein gravity is only a low curvature limit of some more complicated theory 
whose action contains higher powers of the curvature invariants, for example, 


1 
SSS (R+aR* + BRR” +yR>+---)J=gd*x. (5.107) 
IU 


The quadratic and higher-order terms can be either of fundamental origin or they can 
arise as aresult of vacuum polarization. The corresponding dimensional coefficients 
in front of these terms are likely of Planckian size. The theory with (5.107) can 
provide us with inflation. This can easily be understood. Einstein gravity is the only 
metric theory in four dimensions where the equations of motion are second order. 
Any modification of the Einstein action introduces higher-derivative terms. This 
means that, in addition to the gravitational waves, the gravitational field has extra 
degrees of freedom including, generically, a spin 0 field. 


Problem 5.15 Consider a an theory with metric g,,, and action 


— | FR) J—gd'x, (5.108) 
~ T6n 


where f(R) is an arbitrary function of the scalar curvature R. Derive the following 
equations of motion: 


af 1 af af 
shRt — 5onr+(36 5) a Ga —0. (5.109) 


Verify that under the conformal transformation gy, > Zu» = F'gyv, the Ricci ten- 
sor and the scalar curvature transform as 


i" tnt od . 3 , 
Ru > RY = FN RY — FOR ates + POF RM, (5.110) 


_ 3 
R>R=F'R-3F °F + 5 PO Fak. (5.111) 
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Introduce the “scalar field”’ 


and show that the equations 
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coincide with (5.109) if we set F = df/0R and take the following potential for the 
scalar field: 
1 f—Rdf/OR 


VO) = Tex (affaRy 





(5.114) 


Problem 5.16 Study the inflationary solutions in R? gravity: 


1 
== | | R= —, ey he ll 
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What is the physical meaning of the constant M? 


Thus, the higher derivative gravity theory is conformally equivalent to Einstein 
gravity with an extra scalar field. If the scalar field potential satisfies the slow-roll 
conditions, then we have an inflationary solution in the conformal frame for the 
metric %,,,. However, one should not confuse the conformal metric with the original 
physical metric. They generally describe manifolds with different geometries and 
the final results must be interpreted in terms of the original metric. In our case the 
use of the conformal transformation is a mathematical tool which simply allows 
us to reduce the problem to one we have studied before. The conformal metric is 
related to the physical metric g,,,, by a factor F, which depends on the curvature 
invariants; it does not change significantly during inflation. Therefore, we also have 
an inflationary solution in the original physical frame. 

So far we have been considering inflationary solutions due to the potential of 
the scalar field. However, inflation can be realized even without a potential term. 
It can occur in Born—Infeld-type theories, where the action depends nonlinearly on 
the kinetic energy of the scalar field. These theories do not have higher-derivative 
terms, but they have some other peculiar properties. 


Problem 5.17 Consider a scalar field with action 
S= / p(X, g) /—gd*x, (5.116) 


where p is an arbitrary function of g and X = +(d.99"¢). Verify that the energy— 
momentum tensor for this field can be written in the form 


Tt =(e+ p)ubu, — pd", (5.117) 


where the Lagrangian p plays the role of the effective pressure and 


Op IyQ 
€=2X—-p, w= ; 
aX JV 2X 
If the Lagrangian p satisfies the condition Xdp/0X < p for some range of X and 
gy, then the equation of state is p ~ —e and we have an inflationary solution. Why 





(5.118) 
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is inflation not satisfactory if p depends only on X? Consider a general p(X, ¢) 
without an explicit potential term, that is, p — 0 when X — 0. Formulate the 
conditions which this function must satisfy to provide us with a slow-roll infla- 
tionary stage and a graceful exit. The inflationary scenario based on the nontrivial 
dependence of the Lagrangian on the kinetic term is called k inflation. 


Scenarios The simplest inflationary scenarios can be subdivided into three classes. 
They correspond to the usual scalar field with a potential, higher-derivative gravity 
and k inflation. The cosmological consequences of scenarios from the different 
classes are almost indistinguishable — they can exactly imitate each other. Within 
each class, however, we can try to make further distinctions by addressing the 
questions: (a) what was before inflation and (b) how does a graceful exit to a 
Friedmann stage occur? For our purpose it will be sufficient to consider only the 
simplest case of a scalar field with canonical kinetic energy. The potential can have 
different shapes, as shown in Figure 5.7. The three cases presented correspond to 
the so-called old, new and chaotic inflationary scenarios. The first two names refer 
to their historical origins. 

Old inflation (see Figure 5.7(a)) assumes that the scalar field arrives at the local 
minimum of the potential at g¢ = 0 as a result of a supercooling of the initially 
hot universe. After that the universe undergoes a stage of accelerated expansion 
with a subsequent graceful exit via bubble nucleation. It was clear from the very 
beginning that this scenario could not provide a successful graceful exit because 
all the energy released in a bubble is concentrated in its wall and the bubbles have 
no chance to collide. This difficulty was avoided in the new inflationary scenario, 
a scenario similar to a successful model in higher-derivative gravity which had 
previously been invented. 

New inflation is based on a Coleman—Weinberg type potential (Figure 5.7(b)). 
Because the potential is very flat and has a maximum at g = 0, the scalar field 
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(b) 
Fig. 5.7. 
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escapes from the maximum not via tunneling, but due to the quantum fluctuations. 
It then slowly rolls towards the global minimum where the energy is released 
homogeneously in the whole space. Originally the pre-inflationary state of the 
universe was taken to be thermal so that the symmetry was restored due to thermal 
corrections. This was a justification for the initial conditions of the scalar field. 
Later it was realized that the thermal initial state of the universe is quite unlikely, 
and so now the original motivation for the initial conditions in the new inflationary 
model seems to be false. Instead, the universe might be in a “self-reproducing” 
regime (for more details see Section 8.5). 

Chaotic inflation is the name given to the broadest possible class of potentials 
satisfying the slow-roll conditions (Figure 5.7(c)). We have considered it in detail 
in the previous sections. The name chaotic is related to the possibility of having 
almost arbitrary initial conditions for the scalar field. To be precise, this field must 
initially be larger than the Planckian value but it is otherwise arbitrary. Indeed, it 
could have varied from one spatial region to another and, as a result, the universe 
would have a very complicated global structure. It could be very inhomogeneous 
on scales much larger than the present horizon and extremely homogeneous on 
“small” scales corresponding to the observable domain. We will see in Section 8.5 
that in the case of chaotic inflation, quantum fluctuations lead to a self-reproducing 
universe. 

Since chaotic inflation encompasses so many potentials, one might think it worth- 
while to consider special cases, for example, an exponential potential. For an ex- 
ponential potential, if the slow-roll conditions are satisfied once, they are always 
satisfied. Therefore, it describes (power-law) inflation without a graceful exit. To 
arrange a graceful exit we have to “damage” the potential. For two or more scalar 
fields the number of options increases. Thus it is not helpful here to go into the 
details of the different models. 

In the absence of the underlying fundamental particle theory, one is free to 
play with the potentials and invent more new scenarios. In this sense the situation 
has changed since the time the importance of inflation was first realized. In fact, 
in the 1980s many people considered inflation a useful application of the Grand 
Unified Theory that was believed to be known. Besides solving the initial condi- 
tions problem, inflation also explained why we do not have an overabundance of the 
monopoles that are an inevitable consequence of a Grand Unified Theory. Either 
inflation ejects all previously created monopoles, leaving less than one monopole 
per present horizon volume, or the monopoles are never produced. The same ar- 
gument applies to the heavy stable particles that could be overproduced in the 
state of thermal equilibrium at high temperatures. Many authors consider the solu- 
tion of the monopole and heavy particle problems to be as important as a solution 
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of the initial conditions problem. We would like to point out, however, that the 
initial conditions problem is posed to us by nature, while the other problems are, at 
present, not more than internal problems of theories beyond the Standard Model. 
By solving these extra problems, inflation opens the door to theories that would 
otherwise be prohibited by cosmology. Depending on one’s attitude, this is either 
a useful or damning achievement of inflation. 


De Sitter solution and inflation The last point we would like to make concerns the 
role of a cosmological constant and a pure de Sitter solution for inflation. We have 
already said that the pure de Sitter solution cannot provide us with a model with 
a graceful exit. Even the notion of expansion is not unambiguously defined in de 
Sitter space. We saw in Section 1.3.6 that this space has the same symmetry group 
as Minkowski space. It is spatially homogeneous and time-translation-invariant. 
Therefore any space-like surface is a hypersurface of constant energy. To charac- 
terize an expansion we can use not only k = 0, +1 Friedmann coordinates but also, 
for example, “static coordinates” (see Problem 2.7), which describe an expanding 
space outside the event horizon. In all these cases the 3-geometries of constant time 
hypersurfaces are very different. These differences, however, simply characterize 
the different slicings of the perfectly symmetrical space and there is no obvious 
preferable choice for the coordinates. 

It is important therefore that inflation is never realized by a pure de Sitter so- 
lution. There must be deviations from the vacuum equation of state, which finally 
determine the “hypersurface” of transition to the hot universe. The de Sitter universe 
is still, however, a very useful zeroth order approximation for nearly all inflationary 
models. In fact, the effective equation of state must satisfy the condition e + 3p <0 
for at least 75 e-folds. This is generally possible only if during most of the time 
we have p © —¢ to a rather high accuracy. Therefore, one can use the language 
of constant time hypersurfaces defined in various coordinate systems in de Sit- 
ter space. Our earlier considerations show that the transition from inflation to the 
Friedmann universe occurs along a hypersurface of constant time in the expanding 
isotropic a coordinates (7 = const), but not along a r = const hypersurface in the 
“static coordinates.” The next question is out of the three possible isotropic coor- 
dinate systems (k = 0, +1), which must be used to match the de Sitter space to 
the Friedmann universe? Depending on the answer to this question, we obtain flat, 
open or closed Friedmann universes. It turns out, however, that this answer seems 
not to be relevant for the observable domain of the universe. In fact, if inflation 
lasts more than 75 e-folds, the observable part of the universe corresponds only to 
a tiny piece of the matched global conformal diagrams for de Sitter and Friedmann 
universes. This piece is located near the upper border of the conformal diagram for 
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de Sitter space and the lower border for the flat, open or closed Friedmann universes 
(Figure 5.8), where the difference between the hypersurfaces of constant time for 
flat, open or closed cases is negligibly small. After a graceful exit we obtain a very 
large domain of the Friedmann universe with incredibly small flatness and this 
domain covers all present observable scales. The global structure of the universe 
on scales much larger than the present horizon is not relevant for an observer — at 
least not for the next 100 billion years. In Part II we will see that the issue of the 
global structure is complicated by quantum fluctuations. These fluctuations are am- 
plified during inflation and as a result the hypersurface of transition has “wrinkles.” 
The wrinkles are rather small on scales corresponding to the observable universe 
but they become huge on the very large scales. Hence, globally the universe is 
very different from the Friedmann space and the question about the spatial curva- 
ture of the whole universe no longer makes sense. It also follows that the global 
properties of an exact de Sitter solution have no relevance for the real physical 
universe. 


Part II 


Inhomogeneous universe 


6 


Gravitational instability in Newtonian theory 


Measurements of the cosmic microwave background tell us that the universe was 
very homogeneous and isotropic at the time of recombination. Today, however, the 
universe has a well developed nonlinear structure. This structure takes the form of 
galaxies, clusters and superclusters of galaxies, and, on larger scales, of voids, sheets 
and filaments of galaxies. Deep redshift surveys show, however, that when averaged 
over a few hundred megaparsecs, the inhomogeneities in the density distribution 
remain small. The simple explanation as to how nonlinear structure could develop 
from small initial perturbations is based on the fact of gravitational instability. 

Gravitational instability is a natural property of gravity. Matter is attracted to 
high-density regions, thus amplifying already existing inhomogeneities. To ensure 
that the small initial inhomogeneities present at recombination produce the nonlin- 
ear structure observed today, we have to study how fast they grow in an expanding 
universe. The complete general relativistic analysis of gravitational instability is 
rather involved and the physical interpretation of the results is not always straight- 
forward. For this reason we develop the theory of gravitational instability in several 
steps. 

In this chapter we consider gravitational instability in the Newtonian theory of 
gravity. The results derived in this theory are applicable only to nonrelativistic 
matter on scales not exceeding the Hubble horizon. First, we find out how small 
inhomogeneities grow in a nonexpanding universe (Jeans theory). The main pur- 
pose here is to determine which types of perturbations can exist in homogeneous, 
isotropic media, and to introduce methods to analyze them. Although the formulae 
describing the rate of instability in a nonexpanding universe are not very useful, 
the results obtained help us gain a solid intuitive understanding of the behavior 
of perturbations. Next, we consider linear perturbations in an expanding universe. 
This is not only a useful exercise, but a realistic theory describing the growth of 
inhomogeneities on subhorizon scales after recombination. We apply this theory 
to study the rate of instability in a matter-dominated universe and then see how a 


265 


266 Gravitational instability in Newtonian theory 


smooth unclustered energy component, like radiation or vacuum energy, influences 
the growth of inhomogeneities in the cold matter component. Finally, we derive a 
few exact solutions which describe the behavior of perturbations with certain spatial 
geometrical symmetries into the nonlinear regime. Based on these solutions, we are 
able to explain the general features of the matter distribution on nonlinear scales. 


6.1 Basic equations 


On large scales matter can be described in a perfect fluid approximation. This means 
that at any given moment of time it can be completely characterized by the energy 
density distribution ¢(x, f), the entropy per unit mass S(x, f), and the vector field 
of 3-velocities V(x, t). These quantities satisfy the hydrodynamical equations and 
we begin with a brief reminder of their derivation. 


Continuity equation If we consider a fixed volume element AV in Euler (nonco- 
moving) coordinates x, then the rate of change of its mass can be written as 


dM(t) =) 0e(x, 1) IV. 





6.1 
dt ot CD) 
AV 


On the other hand, this rate is entirely determined by the flux of matter through the 
surface surrounding the volume: 





dM(t) 
oh eV -dao=— | V(eV)dV. (6.2) 
AV 
These two expressions are consistent only if 
a) 
<- + V(eV) =0. (6.3) 


Euler equations The acceleration g of a small matter element of mass AM is 
determined by the gravitational force 


F,, = —AM -V@, (6.4) 
where ¢ is the gravitational potential, and by the pressure p: 
Fy =~ fp-da =f vpdv ~—vp- ay, (6.5) 
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With 


_ dvix). t) (aV) dx (AV) _aV 
oS dt -($) + di (=)- coo 





6.2 Jeans theory 267 


Newton’s force law 


AM -g=F,, + Fp, (6.7) 
becomes the Euler equations 
av Vv 
st: v)v+— V6 =O, (6.8) 


Conservation of entropy Neglecting dissipation, the entropy of a matter element is 
conserved: 


dS(x(t),t) aS 7 
Wa TV VS =0. (6.9) 





Poisson equation Finally, the equation which determines the gravitational potential 
is the well known Poisson equation, 


Ad = 41Ge. (6.10) 
Equations (6.3), (6.8)—(6.10), taken together with the equation of state 


p=p(e,$), (6.11) 


form a complete set of seven equations which, in principle, allows us to determine 
the seven unknown functions ¢, V, S, ¢, p. Note that only the first five equations 
contain first time derivatives. Hence the most general solution of these equations 
should depend on five constants of integration which in our case are five arbitrary 
functions of the spatial coordinates x. The hydrodynamical equations are nonlinear 
and in general it is not easy to find their solutions. However, to study the behavior of 
small perturbations around a homogeneous, isotropic background, it is appropriate 
to linearize them. 


6.2 Jeans theory 


Let us first consider a static nonexpanding universe, assuming the homogeneous, 
isotropic background with constant, time-independent matter density: eo(t, x) = 
const. This assumption is in obvious contradiction with the hydrodynamical equa- 
tions. In fact, the energy density remains unchanged only if the matter is at rest 
and the gravitational force, F « V@, vanishes. But then the Poisson equation 
Ag = 41Géq is not satisfied. This inconsistency can, in principle, be avoided 
if we consider a static Einstein universe, where the gravitational force of the matter 
is compensated by the “antigravitational” force of an appropriately chosen cosmo- 
logical constant. 


268 Gravitational instability in Newtonian theory 


Slightly disturbing the matter distribution, we have: 


é(x, tf) = é€9 + de(x,t), V(x,t) = Vy + dv =dvix, b), 


6.12 
HX, 1) = bo +5G(K,1), SK, 1) = So + 851K, 1), Oe 

where d€ < & , etc. The pressure is equal to 
D(x, t) = p(é + de, So + 6S) = po + dp(x, t), (6.13) 


and in linear approximation its perturbation dp can be expressed in terms of the 
energy density and entropy perturbations as 


dp = Se + 06S. (6.14) 


Here Ce =(0p/de)s is the square of the speed of sound and o =(dp/dS),. For 
nonrelativistic matter (p <« ¢), the speed of sound as well as the velocities dv are 
much less than the speed of light. 

Substituting (6.12) and (6.14) into (6.3), (6.8)-(6.10) and keeping only the terms 
which are linear in the perturbations, we obtain: 


06€ 
cs + egV(dv) = 0, (6.15) 
adv 2 o 
—++4—V6dbe+ —V6S + Vdb¢ = 0, (6.16) 
ot €0 E€0 
06S 
—=0, 6.17 
F (6.17) 
Aéd¢ = 41Gée. (6.18) 


Equation (6.17) has a simple general solution 
5S(x, t) = dS(x), (6.19) 


which states that the entropy is an arbitrary time-independent function of the spatial 
coordinates. 

Taking the divergence of (6.16) and using the continuity and Poisson equations 
to express Vév and Aéd¢ in terms of de, we obtain 


a75e 
ar2 


This is a closed, linear equation for de, where the entropy perturbation serves as a 
given source. 


— Abe — 40 Gende = o ASS(X). (6.20) 
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6.2.1 Adiabatic perturbations 


First we will assume that entropy perturbations are absent, that is, 6S = 0. The 
coefficients in (6.20) do not depend on the spatial coordinates, so upon taking the 
Fourier transform, 

d°k 
(20 3/2 ? 
we obtain a set of independent ordinary differential equations for the time-dependent 
Fourier coefficients dé,(t): 


6e(x, t) = [exe exp(ikx) (6.21) 


5& + (kc. — 47 Geo) Sex = 0, (6.22) 


where a dot denotes the derivative with respect to time ¢ and k = |k| . 
Equation (6.22) has two independent solutions 


b& & exp(tia(k) t), (6.23) 


alk) = ,/k?c? — 4 Geo. 


The behavior of these so-called adiabatic perturbations depends crucially on the 
sign of the expression under the square root. Defining the Jeans length as 


20 x \\/? 
Ay = =6s5{ —— ; 6.24 
cars rep (=) (6.24) 


so that w(k 7) = 0, we conclude that if A < 4,, the solutions describe sound waves 


where 


d€ X sin(wt + kx + q@), (6.25) 


propagating with phase velocity 


k2 
Cphase = ; = Cs,/ 1 — ro (6.26) 


In the limit k >> k;, or on very small scales (A «< 47) where gravity is negligible 
compared to the pressure, we have C phase — Cs, as it should be. 
On large scales gravity dominates and if A > 47, we have 


dé X exp(+ || rf). (6.27) 


One of these solutions describes the exponentially fast growth of inhomogeneities, 
while the other corresponds to a decaying mode. Whenk — 0, |@| t > t/t,-, where 
ter =(4a Geo) |? . We interpret tgy as the characteristic collapse time for a region 
with initial density &. 
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The Jeans length A7 ~ Cstg, is the “sound communication” scale over which 
the pressure can still react to changes in the energy density due to gravitational 
instability. Gravitational instability is very efficient in a static universe. Even if the 
adiabatic perturbation is initially extremely small, say 10~!°, gravity needs only a 
short time t ~ 230t,, to amplify it to order unity. 


Problem 6.1 Find and analyze the expression for dv, and 5, for sound waves and 
for the perturbations on scales larger than the Jeans wavelength. 


6.2.2 Vector perturbations 


The trivial solution of (6.20) with d¢ = 0 and 6S = Ocancorrespond to a nontrivial 
solution of the complete system of the hydrodynamical equations. In this case, in 
fact, (6.15)-(6.18) reduce to 


ddv 
oa 0. (6.28) 
From the second equation it follows that dv can be an arbitrary time-independent 
function of the spatial coordinates, dv = dv (x). The first equation tells us that for 
the plane wave perturbation, dv = wx exp(ikx), the velocity is perpendicular to the 
wave vector k: 


Vov = 0, 


wi-k = 0. (6.29) 


These vector perturbations describe shear motions of the media which do not dis- 
turb the energy density. Because there are two independent directions perpendicular 
to k, there exist two independent vector modes for a given k. 


6.2.3 Entropy perturbations 


In the presence of entropy inhomogeneities (6S 4 0), the Fourier transform of 
(6.20) is 


5& + (kc; — 47 Ge) dex = —ok* 5S. (6.30) 


The general solution of this equation can be written as the sum of its particular 
solution and a general solution of the homogeneous equation with 5S, = 0. The 
particular time-independent solution of (6.30), 
ok?5S 
be = —— >." (6.31) 
(k2c2 — 4 Geo) 

is called the entropy perturbation. Note, that in the short distance limit k > oo, 
when gravity is unimportant, de, — —o5S,/c?. In this case the contribution to 
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the pressure due to the energy density inhomogeneities is exactly compensated 
by the corresponding contribution from the entropy perturbations, so that dp, = 
C2 5ex + 06S, vanishes. 

Entropy perturbations can occur only in multi-component fluids. For example, in 
a fluid consisting of baryons and radiation, the baryons can be distributed inhomo- 
geneously on a homogeneous background of radiation. In such a case, the entropy, 
which is equal to the number of photons per baryon, varies from place to place. 

Thus we have found the complete set of modes — two adiabatic modes, two 
vector modes and one entropy mode — describing perturbations in a gravitating 
homogeneous non-expanding medium. The most interesting is the exponentially 
growing adiabatic mode which is responsible for the origin of structure in the 
universe. 


6.3 Instability in an expanding universe 


Background \n an expanding homogeneous and isotropic universe, the background 
energy density is a function of time, and the background velocities obey the Hubble 


law: 
e€=é&(t), V=Vo= HA(t)-x. (6.32) 
Substituting these expressions into (6.3), we obtain the familiar equation 
£9 + 3Hep = 0, (6.33) 


which states that the total mass of nonrelativistic matter is conserved. The diver- 
gence of the Euler equations (6.8) together with the Poisson equation (6.10) leads 
to the Friedmann equation: 

4nG 





H+H= £0. (6.34) 


Perturbations Ignoring entropy perturbations and substituting the expressions 


&é=&+46e(x,t) V=Vot+dv, ¢6= 4+ 59, 


P = po + dp = po + c2de, 8.92) 


into (6.3), (6.8), (6.10), we derive the following set of linearized equations for small 
perturbations: 


35 
= Lenser eeVe aU (6.36) 
0dv e 
—~ + (Vo: V)8V-+ (BV: V)Vo + Vibe + V5b = 0, (6.37) 
0 


Add = 41Gébe. (6.38) 
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The Hubble velocity Vo depends explicitly on x and therefore the Fourier transform 
with respect to the Eulerian coordinates x does not reduce these equations to a 
decoupled set of ordinary differential equations. This is why it is more convenient 
to use the Lagrangian (comoving with the Hubble flow) coordinates q, which are 
related to the Eulerian coordinates via 


x = a(t)q, (6.39) 


where a(t) is the scale factor. The partial derivative with respect to time taken at 
constant x is different from the partial derivative taken at constant q. For a general 
function f(x, t) we have 


af(ix=aq.t)\ _(af\ ,. :( af 
(a) Gr) tae), 
] ) 
Gs). = (5), — (Vo: Vx). (6.41) 


The spatial derivatives are more simply related: 


1 
Vy = —Vq. (6.42) 
a 





and therefore 


Replacing the derivatives in (6.36)—(6.38) and introducing the fractional ampli- 
tude of the density perturbations 6 = d/o, we finally obtain 


a5 1 
(=) + +vsv=0, (6.43) 
ot a 
0dv ce 1 
— )+ Hdv+ V5 + —Vs¢ =0, (6.44) 
ot a a 
Ado = 42 Ga’ E08, (6.45) 


where V = Vj, and A are now the derivatives with respect to the Lagrangian coor- 
dinates q and the time derivatives are taken at constant q. In deriving (6.43) we have 
used (6.33) for the background and noted that V, Vo = 3A and (6v - Vx)Vo = Hv. 
Taking the divergence of (6.44) and using the continuity and Poisson equations to 
express Vdv and Ad¢ in terms of 5, we derive the closed form equation 


2 
§+2H5 —2As — 4xGend = 0, (6.46) 
a 


which describes gravitational instability in an expanding universe. 
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6.3.1 Adiabatic perturbations 
Taking the Fourier transform of (6.46) with respect to the comoving coordinates q, 
we obtain the ordinary differential equation 


x ‘ Ck 
ox +2Hd, + ( = nGe0) on = 0 (6.47) 





a2 


for every Fourier mode 5 = 6,(t) exp(ikq). The behavior of each perturbation 
depends crucially on its spatial size; the critical lengthscale is the Jeans length 


eh 8 7 (6.48) 
SS Se : 
z ky “VW Geo 


Here 4?" is the physical wavelength (measured, for example, in centimeters), related 

to the comoving wavelength 4 = 27/k via 4?" =a-X. Ina flat, matter-dominat- 
—1 

ed universe €) = (6x Gt’) and hence 


ae est, (6.49) 


that is, the Jeans length is of order the sound horizon. Sometimes instead of the 
Jeans length, one uses the Jeans mass, defined as M; = eg(Ab"y3, 

Perturbations on scales much smaller than the Jeans length (A < 4,) are sound 
waves. If c; changes adiabatically, then the solution of (6.47) is 


: (+8 / 9) (6.50) 
= exp ap : 


Problem 6.2 Derive the solution in (6.50). Explain why the amplitude of sound 
waves decays with time. (Hint Using conformal time n = { dt/a instead of the 
physical time r, derive the equation for the rescaled amplitude ,/a5, and solve it in 
the WKB approximation.) 








OK Xx 


On scales much larger than the Jeans scale (A >> 4,), gravity dominates and we 
can neglect the k-dependent term in (6.47). Then one of the solutions is simply 
proportional to the Hubble constant H(t). In fact, substituting 6, = H(t) in (6.47), 
where we set c?k” = 0, one finds that the resulting equation coincides with the time 
derivative of the Friedmann equation (6.34). Note that 6; = H(t) is the decaying 
solution of the perturbation equation (H decreases with time) in a matter-dominated 
universe with arbitrary curvature. 

Actually, one could have guessed this solution using the following simple ar- 
gument. Both the background energy density ¢o(t) and the time-shifted energy 
density ¢9(t + tT), where t = const, satisfy (6.33), (6.34). Indeed, using (6.33) 
to express H in terms of €9 and substituting this into (6.34), we obtain an equa- 
tion for €o(f) in which the time does not explicitly appear. Hence its solution is 
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time-translational-invariant. For small 7, the time-shifted solution e9(t + tT) can be 
considered as a perturbation of the background ¢o(t) with amplitude 

iy = Eo(t + T) — Eo(t) is &0T 

Eo(t) £0 

Once we know one solution of the second order differential equation, 5,, then the 

other independent solution 6; can easily be found with the help of the Wronskian, 





« H(t). 


W = 546; — 643). (6.51) 


Taking the derivative of the Wronskian and using (6.47) to express 4 in terms of 4 
and 6, we find that W satisfies the equation 


W =-2HW, (6.52) 


which has the obvious solution 
. : C 
W = 646; — 69d; = a (6.53) 
a 


where C is aconstant of integration. Substituting the ansatz 6; = 64 f(t) into (6.53), 
we obtain an equation for f that is readily integrated: 


f=-cC [= (6.54) 


252° 
a°b5 


Thus the most general longwave solution of (6.47) is 
dt 


Ina flat, matter-dominated universe, a « t?/3 and H « t~!. In this case, we have 
6=C P74 Ct7!. (6.56) 


Hence, we see that in an expanding universe, gravitational instability is much less 
efficient and the perturbation amplitude increases only as a power of time. In the im- 
portant case of a flat, matter-dominated universe, the growing mode is proportional 
to the scale factor. Therefore, if we want to obtain large inhomogeneities (5 2 1) 
today, we have to assume that at early times (for example, at redshifts z = 1000) the 
inhomogeneities were already substantial (6 > 10-7). This imposes rather strong 
constraints on the initial spectrum of perturbations. We will see in Chapter 8 that 
the required initial spectrum can be explained naturally in inflationary cosmology. 


Problem 6.3 Calculate the peculiar velocities and gravitational potential for the 
long-wavelength perturbations. Analyze their behavior and give the physical inter- 
pretation of the behavior of the gravitational potential for the growing mode. 
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Problem 6.4 If we use the redshift z as a time parameter, then the integral in (6.55) 
can be calculated explicitly for a matter-dominated universe with arbitrary Qo. 
Find the corresponding solution 5(z) and show that for Qo « 1 the perturbation 
amplitude freezes out at z ~ 1/Qp. 


6.3.2 Vector perturbations 
With 6 = 0, (6.43)-(6.45) reduce to 


0d6v 
Vév = 0, De + Hiv =0. (6.57) 


From the first equation it follows that for a plane wave perturbation, dv « 
dV, (t) exp(ikq), the peculiar velocity dv is perpendicular to the wavenumber k. 
The second equation becomes 


dt, + ov, = 0, (6.58) 
a 


and has the obvious solution dv; « 1/a. Thus, the vector perturbations decay as the 
universe expands. These perturbations can have significant amplitudes at present 
only if their initial amplitudes were so large that they completely spoiled the isotropy 
of the very early universe. In an inflationary universe there is no room for such large 
primordial vector perturbations and they do not play any role in the formation of 
the large-scale structure of the universe. Vector perturbations, however, can be 
generated at late times, after nonlinear structure has been formed, and can explain 
the rotation of galaxies. 


6.3.3 Self-similar solution 


For large-scale perturbations we can neglect the pressure, and the spatial derivatives 
drop out of (6.46). In this case the solution for perturbations can be written directly 
in coordinate space 


8(q, t) = A(q) 6;(t) + B(q) da(t), (6.59) 


where 6; and 6g are growing and decaying modes respectively. Without losing 
generality we can set 6;(fo) = da(to) = 1 at some initial moment of time fo. If the 
density distribution at this time is described by the function 5(q, fo) and the matter 
is at rest with respect to the Hubble flow (6v « 4(q, to) = 0), then, expressing A(q) 
and B(q) in terms of 45(q, to), we obtain 


6;(t) da(t) 
1 —(3;/52), 1 —(84/5i), : 





6(q, t) = 6(q, on (6.60) 
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In this particular case the perturbation preserves its initial spatial shape as it devel- 
ops. Such a solution is said to be self-similar. 

Generically the shape of inhomogeneity changes. However, at late times (t >> f9) 
when the growing mode dominates, we can omit the second term in (6.59) and the 
linear perturbation grows in a self-similar way. 


6.3.4 Cold matter in the presence of radiation or dark energy 


There is convincing evidence that along with the cold matter in the universe there 
exists a smooth dark energy component. This dark energy changes the expansion 
rate and, as a result, influences the growth of inhomogeneities in the cold matter. 

To study the gravitational instability in the presence of relativistic matter, in prin- 
ciple we need the full relativistic theory. However, on scales smaller than the Jeans 
length for relativistic matter, which is comparable to the horizon scale, the inhomo- 
geneities in the cold matter distribution do not disturb the relativistic component 
and it remains practically homogeneous. As a result we can still apply modified 
Newtonian theory to the perturbations in the cold matter itself on scales smaller 
than the horizon. In the following we consider the growth of the perturbations in the 
presence of a homogeneous relativistic energy component. This can be radiation, 
with equation of state w = 1/3, or dark matter with w < —1/3. 

It is easy to verify that the equation for the perturbation in the cold component 
alone, 6 = d&€4/€g, coincides with (6.46), but the Hubble constant is now determined 
by the total energy density 


cw 2%) +(%2)""”) a 


via the usual relation, which for a flat universe is 


8G 
H= Fiat (6.62) 


Here de, is the scale factor at “equality” when the energy densities of both compo- 
nents are equal. To find the explicit solutions of (6.46), it is convenient to rewrite it 
using as a time variable the normalized scale factor x = a/a,, instead of t. Taking 
into account that €9 entering (6.46) is the cold matter density alone, equal to 


eq = a(S). (6.63) 


and using (6.62) to express the Hubble parameter in terms of x, (6.46) becomes 


as 3 dé 3 
vs —3w —3w 
x (1 +x ) 2 + 5x (1 +d -—w)x ) re Ae = 0. (6.64) 
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We have skipped in (6.64) the term proportional to c? because it is determined by 
the pressure of the cold matter alone and hence is negligible. The general solution 
of (6.64) for an arbitrary w = const is given by a linear combination of hyperge- 
ometric functions. However, at least in two important cases, they reduce to simple 
elementary functions. 


Cosmological constant (w = —1) One can easily verify that in this case, 


b\(x) =V14+x-3 (6.65) 


satisfies (6.64). The other solution can be obtained using the properties of the 
Wronskian. 


Problem 6.5 Verify that if 5,(x) is a solution of (6.64), then the second independent 
solution is given by 








x a 
Ba(x) = u(x) f — (6.66) 
yP(1 + yo)" 81) 
For w = —1, the general solution of (6.64) is thus 
7 7 3/2 
5(x) = citi +cavi+e9 [ (5) dy, (6.67) 
0 


where C, and C> are constants of integration. At early times when the cold matter 
dominates (x < 1), the perturbation grows as 


5(x) = Cyx 9? + 2Cyx + O(x?”), (6.68) 


in complete agreement with our previous result. Subsequently the cosmological 
constant becomes dominant and in the limit x >> 1 we have 





5(x) =(Cy + IC) — $Cax~? + O(x7?), (6.69) 
where 
r= / ( a ye dy = 0.57. (6.70) 
0 ay 


Thus, when the cosmological constant overtakes the matter density the growth 
ceases and the amplitude of the perturbation is frozen. According to (6.45), the 
induced gravitational potential decays in inverse proportion to the scale factor 
since €g x a~>. The results obtained are not surprising because the cosmological 
constant acts as “antigravity” and tries to prevent the growth of the perturbations. 
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Problem 6.6 Verify that if w = —1 or w = —1/3, then 6 « H is the solution 
of (6.46) in the universe with arbitrary curvature. Using (6.55), find the general 
solutions in these cases and analyze their behavior in open and closed universes. 


Radiation background The Jeans length for radiation is comparable to the horizon 
size because the speed of sound in the radiation component is of order the speed of 
light (c? = 1/3). Therefore cold dark matter, which interacts only gravitationally 
with radiation, does not induce significant perturbations in radiation and to study 
the growth of inhomogeneities in cold matter alone we can still use (6.64), setting 
w = 1/3. In this case, 


b4(x) =14+ - (6.71) 


satisfies (6.64). The other independent solution can be found by substituting (6.71) 
into (6.66). The integral can be calculated explicitly and the general solution of 
(6.64) is 





3 Vl+x4+1 
b(x) = Cy (1+ 5*) + Co (43 5 xin Jai — 3/1 ae (6.72) 


At early times, during the radiation-dominated stage (x < 1), the amplitude of 
perturbations grows as 


d(x) =(C, — 3C2) — C2 In(x/4) + O(x), (6.73) 


that is, logarithmically at most. Thus, by influencing the rate of expansion, the 
radiation suppresses the growth of inhomogeneities in the cold component. After 
matter—-radiation equality, matter overtakes radiation and at x >> 1, the amplitude 
of perturbation is 


d(x) = Cy (1 + 5*) + SOx? + O(x°), (6.74) 
that is, it grows proportionally to the scale factor. Since the perturbations cannot 
be amplified significantly during the radiation epoch, small initial perturbations 
can produce nonlinear structure only if the cold matter starts to dominate early 
enough. This imposes a lower bound on the amount of cold matter. In particular, 
if the amplitude of the initial inhomogeneities is of order 10~* , as favored by the 
observed CMB fluctuations, we can explain the nonlinear structure seen only if the 
initial perturbations started to grow before recombination. This is possible if there 
exists a cold dark matter component of non-baryonic origin, which interacts only 
gravitationally with radiation. We can reconcile the small initial perturbations with 
the observed large-scale structure only if this dark matter constitutes about 30% of 
the present critical density. It is clear that baryons cannot substantially contribute 
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to the dark matter. They are tightly coupled to radiation before recombination and 
perturbations in the baryon component can start to grow only after recombination, 
when the dark matter must already have begun to cluster. Furthermore, a high 
baryon density would spoil nucleosynthesis. 


6.4 Beyond linear approximation 


The Hubble flow stretches /inear inhomogeneities; their spatial size is proportional 
to the scale factor. The relative amplitude of the linear perturbation grows, while 
its energy density, equal to 


€ = 69(1+5+ 0(6’)), 


decays only slightly more slowly than the background energy density é . It is 
obvious that when the perturbation amplitude reaches unity(6 ~ 1), the neglected 
nonlinear terms ~é? etc., become important. At this time the gravitational field 
created by the perturbation leads to a contraction which overwhelms the Hubble 
expansion. As a result the inhomogeneity drops out of the Hubble flow, reaches its 
maximal size and recollapses to form a stable nonlinear structure. 

Even for pressureless matter, exact solutions describing nonlinear evolution can 
be obtained only in a few particular cases where the spatial shape of the inho- 
mogeneity possesses a special symmetry. To build intuition about the nonlinear 
behavior of perturbations we will derive exact solutions in two special cases: for 
a spherically symmetric perturbation and for an anisotropic one-dimensional in- 
homogeneity. The behavior of realistic nonsymmetric perturbations can then be 
qualitatively understood on the basis of these two limiting cases. 

Let us first recast the hydrodynamical equations (6.3), (6.8), (6.10) in a slightly 
different form, which is more convenient for finding their nonlinear solutions. The 
continuity equation (6.3) can be written as 


) 
ot 


where V; = 0/dx'. Taking the divergence of the Euler equations (6.8) and using 
the Poisson equation (6.10), we obtain 


) 
ot 


where we have assumed that the pressure is equal to zero. 
In the next step we replace the Eulerian coordinates x with the comoving La- 
grangian coordinates q, enumerating the matter elements: 


x = x(q, f). (6.77) 





ae v'v,) e+teViV' =0, (6.75) 


x 





+ v'v, (vv) +(Vj)V')(V;V/) +42Ge = 0, (6.76) 
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These new coordinates can be used until the trajectories of the matter elements 
start to cross one other. The velocity of a matter element with the given Lagrangian 
coordinates q is equal to 


dx’ _ dx'(q, t) 


Viz 
dt ot 





(6.78) 





q 
The derivatives of the velocity field with respect to the Eulerian coordinates can 
then be written as 
































. dag* a (adx'(q,t) ag* J; 
VjVi= - = - . 6.79 
z axJ a( at dx/ dt Coe 
where we have introduced the strain tensor 
ax'(q, t) 
Ji(q, t) = ————. 6.80 
(@n=sa (6.80) 
Taking into account that 
0 F 0 ax'(q, t) 0 0 
— V'V; | = -= ; 6.81 
(5 a ) dt |, ot q Ox! ot i ( ) 
and substituting (6.79) into (6.75) and (6.76), we obtain 
dé ag* J} 
— - = 0, 6.82 
at | ax! ar eee 
a (aq* ad} ag* agi \ ( aq! au} 
- - - 4nGe = 0, 6.83 
(ae i) (3 SEIN Dei) eee 


where the time derivatives are taken at constant q. The elements of the strain tensor 
(6.80) form a3 x 3 matrix J = || Jj||, and since 
dg* ax! x! 


oe =—=S!, 
axi agk = axd - 





(6.84) 


the derivatives 0q*/dx' are the elements of the inverse matrix J~!. Consequently, 
we can rewrite (6.82) and (6.83) in matrix notation as 


é+eu(J-J') 20, (6.85) 


[ir(i ; il +r (3 , ry’ +4rGe =0, (6.86) 


where a dot denotes the partial time derivative. 
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Problem 6.7 Prove that 
ur(J , x’) =(InJy, (6.87) 
where J(q, t) = det J. 


After substitution of (6.87) into (6.85), the resulting equation can easily be 
integrated to give 


e(q,t) = ae 


where Qo(q) is an arbitrary time-independent function of the Lagrangian coordi- 
nates. With (6.87) and (6.88), (6.86) simplifies to 





(6.88) 


be 2 1 \4 
(InJ) +tr [(i-5 ) | +4nGooJ~! =0. (6.89) 


This resulting equation for J can be solved exactly for a few interesting cases. 


6.4.1 Tolman solution 


Let us consider a spherically symmetric inhomogeneity. In this case one can always 
find a coordinate system where the strain tensor is proportional to the unit tensor: 


Ji = a(R, t) 81, (6.90) 


where R = |q| is the radial Lagrangian coordinate. Substituting 


pag te (i | = (4) (6.91) 


into (6.89), we obtain 


4m Goo(R 
UR = SO) (6.92) 
3a?(R, t) 
Multiplying this equation by a, we easily derive its first integral 
. 87 Gao(R) 
2 
R, t) - ———— = F(R), 6.93 
a“(R, t) 3a(R. (R) (6.93) 


where F(R) is a constant of integration. Note that for a homogeneous matter dis- 
tribution Qo, a and F do not depend on R and (6.93) coincides with the Friedmann 
equation for a matter-dominated universe. 
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Problem 6.8 Verify that the solution of (6.93) can be written in the following 
parametric form: 





4a Geo 41 Goo ; 
a(R, n) = l1—cosy), t(R,7n) = —,(n - sinn) + fo, 6.94 
(R, n) 31FI | n), t(R, 1m) Tae 1) + to (6.94) 
for F < 0, and 
4a G 4a G 
ORS Gosh ie 1) ARS” Ghar ae 6.95) 








3F 3 F3/2 
for F > 0. Here to = fo(R) is a further integration constant. Note that the same 
“conformal time” 7 generally corresponds to different values of physical time t 
for different R. Assuming that the initial singularity (a — 0) occurs at the same 
moment of physical time t = 0 everywhere in space, we can set fo = 0. 


Let us consider the evolution of a spherically symmetric overdense region in a 
flat, matter-dominated universe. Far away from the center of this region the mat- 
ter remains undisturbed and hence @9(R — co) > Qe = const. The condition of 
flatness requires F — 0 as R —> oo. Taking the limit |F'| — 0 so that the ratio 
n/»/|F | remains fixed, we immediately obtain from (6.94) 


a(R > 00, t) =(61 GOo9)!? 17/7. (6.96) 
The energy density is consequently 


Qoo 1 
e(R> w,t)= 2 GHG (6.97) 
in complete agreement with what one would expect for a flat dust-dominated uni- 
verse. Inside the overdense region, F' is negative and the energy density does not 
continually decrease. Because ¢ « a~*, the density at some point R takes its min- 


imal value €,, when a(R, t) reaches its maximal value 








82 Geo 
in 6.98 
a 3 1F| (6.98) 
at 7 = m (see (6.94)). This happens at the moment of physical time 
4n?Goo 
cS Sa 6.99 
a |Fi3/2 ( ) 
when the energy density is equal to 
R 27|F\? 3 
Bie IE ol (6.100) 





~ a3(R) (8"G)e2— 32Gt2” 
Comparing this result with the averaged density at t = ¢,,, given by (6.97), we find 
that in those places where the energy density exceeds the averaged density by a 
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factor of 
Eon ORF 
e(R> oo) 16 





5.95; (6.101) 


the matter detaches from the Hubble flow and begins to collapse. 

Formally the energy density becomes infinite at t = 21,,; in reality, however, 
this does not happen because there always exist deviations from exact spherical 
symmetry. As a result a spherical cloud of particles virializes and forms a stationary 
spherical object. 


Problem 6.9 Consider a homogeneous spherical cloud of particles at rest and, 
using the virial theorem, verify that after virialization its size is halved. Assuming 
that virialization is completed at t = 2t,,, compare the density inside the cloud with 
the average density in the universe at this time. (Hint The virial theorem states 
that at equilibrium, U = —2K, where U and K are the total potential and kinetic 
energies respectively.) 


Problem 6.10 Assuming that 7 < 1 and expanding the expressions in (6.94) in 
powers of 7, derive the following expansion for the energy density in powers of 


(t/tmy? «1: 
1 3 (6nt\7? ps 
= 1 = .102 
‘ aa + 35( =) +o(2) aa 


where £,, is defined in (6.99). The second term inside the square brackets is obviously 
the amplitude of the linear perturbation 5. Thus, when the actual density exceeds 
the averaged density by a factor of 5.5, according to the linearized theory 





5(tm) = 3(62r)*/3 /20 ~ 1.06. 


Later, at t = 21,,, the Tolman solution formally gives ¢ — oo, while the linear 
perturbation theory predicts 6(2t,,) ~ 1.69. 


6.4.2 Zel’dovich solution 


The geometrical shapes of realistic inhomogeneities are typically far from spher- 
ical and their collapse is strongly anisotropic. To build intuition about the main 
features of anisotropic collapse we consider the Zel’dovich solution. This solution 
describes the nonlinear behavior of a one-dimensional perturbation, superimposed 
on three-dimensional Hubble flow. In this case the relation between the Eulerian 
and Lagrangian coordinates can be written as 


xi= a(t)(q' — fi(¢’, t)) : (6.103) 
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If we ignore vector perturbations then f' = d~/dq', where y is the potential for 
the peculiar velocities. For a one-dimensional perturbation, y depends only on one 
of the coordinates, say g'. Then the strain tensor takes the form 


1- rq), t) 0 0 
J = a(t) 0 1 Of, (6.104) 
0 0 1 


and hence 


ag ,  -1\? a A : 2 
J=a(1—-A), ul(i-3 )|-(#-5) +2H°, (6.105) 


where A(q', t) = af '/dq'. Substituting (6.105) into (6.89), we find that for 09(q) = 
const this equation reduces to two independent equations: 





4nG 
i eeeg ee . B, (6.106) 
A+2HA —4rGeod = 0, (6.107) 


where €0(t) = Qo/a°. The first equation is the familiar Friedmann equation for the 
homogeneous background. The second equation coincides with (6.46) for linear 
perturbations in pressureless matter. However, it must be stressed that in deriving 
(6.107) we did not assume that the perturbations were small, and hence its solutions 
are valid in both the linear and the nonlinear regime. 

According to (6.88), the energy density is equal to 


Eo(t) 


é(q,t)= (T—a(ght))’ (6.108) 
and A(q', t) can be written as (see (6.59)) 
A(q'.t) =a(q') 5;(t) + x(q') ba(t). (6.109) 


Here 6;(t) and d5g(t) are the growing and decaying modes from the linearized the- 
ory. For example, in a flat matter-dominated universe 6; « t7/3 and by « t~!. For 
r(q', t) < 1, the exact solution in (6.108) obviously reproduces the results of the 
linearized theory. 


Problem 6.11 How must (6.89) be modified in the presence of a homogeneous 
relativistic component? Find and analyze the corresponding Zel’dovich solutions 
in a flat universe with a cosmological constant. 
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The decaying mode soon becomes negligible and does not influence the evolution 
even in the nonlinear phase. Ignoring this mode we have 


Eo(t) 
(1 — a(q!)6;(t)) 


In those places where a(q!) is positive, the energy density exceeds the averaged 


e(q,t)= (6.110) 


density ¢o(t) and the relative density contrast grows. However, during the linear 
stage, when ad; < 1, the energy density itself decays. Only after the perturbation 
enters the nonlinear regime (a4; ~ 1) does the inhomogeneous region drop out of 
the Hubble flow and start to collapse. To estimate when this turnaround happens 
we have to find when é(q, t) = 0. The time derivative of the expression in (6.110) 
vanishes when 
Aco ee oe 
E(t) (In 4;) 
In a flat matter-dominated universe, 5; « a and, according to (6.111), as soon as the 
energy density exceeds the averaged density by a factor of 4, the region detaches 
from the Hubble flow and begins to collapse (compare to (6.101)). The collapse is 
one-dimensional and produces a two-dimensional structure known as a Zel’dovich 
“pancake.” According to (6.110), at some moment of time the energy density of 
the pancake becomes infinite. However, in contrast with spherical collapse, the 
gravitational force and velocities at this moment remain finite. Once the matter 
trajectories cross, the solution in (6.110) becomes invalid. 
In places where a(q') is negative, the energy density always decreases. Matter 





(6.111) 


“escapes” from these regions and they eventually become empty. 

In reality, the situation is more complicated because a typical inhomogeneity is 
neither spherical nor one-dimensional. To describe the evolution of a perturbation 
with an arbitrary shape, Zel’dovich suggested generalizing the solution in (6.110) 
to 


Eo(t) 
(1 — ad;(t))(1 — Bdj(t))\ — y6,(t))’ 


where a, 6, y characterize the deformation along the three principle axes of the 





é(q,t)= (6.112) 


strain tensor and they now depend on all coordinates g'. The corresponding strain 
tensor 


a 0 0 
J=al—aé;|0 Bp Of}, (6.113) 
0 0 y 


where I is the unit matrix, satisfies (6.89) only to leading order. Therefore, the 
approximate solution in (6.112) has a very limited range of applicability. In fact, 
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substituting (6.113) into (6.86), we find 


eo[1 — ((aB + ay + By) 5? — 2aBy5;)] 
(1 — «;) — Bd;)1 — yj) 
On the other hand, it follows from (6.88) that ¢ should be given by the ex- 
pression in (6.112). Hence the expected error of the Zel’dovich approxima- 
tion is of order the disagreement between the results in (6.112) and (6.114), 
that is, ~ O (ap t+tayt+ By) 6, apy?) . When the perturbations are small, 
ad;, BO;, v5; « 1, the Zel’dovich approximation reproduces the results of the lin- 
ear perturbation theory. However, in the nonlinear regime, it is not very reliable. If, 
for example, a > f, y, then we can trust only the leading term, given by (6.110), 
which describes one-dimensional contraction along a-axis. The linear corrections 


(6.114) 





eq, = 


BO;, yd; «< 1 become unreliable when wd; reaches a value of order unity. Ifa ~ 6, 
then the formula in (6.112) fails to reproduce even the basic feature of the nonlinear 
collapse. 


6.4.3 Cosmic web 


The strain tensor is very useful when we try to understand the nonlinear large- 
scale structure of the universe. The initial inhomogeneities can be characterized 
completely by the strain tensor, or equivalently by three functions a(qi i: B(qi ), 
y (qi ). How an inhomogeneity will grow in a particular region depends on the 
relation between the values of a, 8, y. Based on the results above, we see that 
the collapse is one-dimensional and produces two-dimensional pancakes (walls) 
in those regions where a > 6, y . In the places where a ~ B > y, we expect 
two-dimensional collapse, leading to formation of one-dimensional filaments. For 
a ~ B ~ y the collapse is nearly spherical. 

For initial Gaussian perturbations the probability distribution of the strain tensor 
eigenvalues can be calculated exactly. A helpful lower dimensional visualization 
of the initial density field is a mountainous landscape in which the mountain peaks 
represent local maxima in the density and valleys correspond to local minima. In 
the concordance model (cold dark matter plus inflationary perturbation spectrum), 
inhomoheneities with significant amplitudes are present in nearly all scales and 
hence mountains with nearly all base sizes are superimposed on each other. If we 
are interested in the structure on scales exceeding some particular size, we have to 
smear the inhomogeneities on smaller scales, in other words, remove the mountains 
with small base sizes. 

The first nonlinear structures are obviously formed near the tallest peaks where 
the energy density takes its maximal value. Typically, the two curvature scales are 
comparable near the mountain peak. Therefore, we expect that at a peak in the 
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density, which is a higher-dimensional analog of a mountain peak, a ~ B ~ y. 
Hence the surrounding region collapses in a nearly spherical way resulting in a 
spherical or somewhat elliptical object. Usually neighbouring peaks are connected 
by a saddleback, which is lower in height than the peaks. Along the saddleback 
one curvature scale is significantly smaller than the other. Similarly, density peaks 
are connected by higher-dimensional saddlebacks, wherea ~ Bandy «a, B. The 
collapse in these regions is two-dimensional and produces filaments. The filaments 
connect the spherical objects formed earlier, resulting in a web-like structure (see 
Figure 6.1). There are also regions that have no analog in the lower-dimensional 
landscape, where only one of the eigenvalues a, say, has a local maximum and 
B, y «<a. In these regions the collapse is one-dimensional and walls (pancakes) 
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are formed. The walls connect the filaments. It is clear that in those places where 
a, B, y are all negative (valleys) the expansion will continue forever. Matter will 
be diluted in these regions and empty voids, occupying most of the volume, will be 
produced. 

Over time these anisotropic structures simply become elements of the structure 
forming on even larger scales. Finally the anisotropic substructure of the larger 
structural units becomes virialized and disappears. That is, the web of filaments 
and walls is only an intermediate stage of evolution between an initial, nearly 
homogeneous state and a final, virialized isotropic structure. 

At present we observe filaments and pancakes on scales which entered the non- 
linear stage not very long ago. Therefore their average density is only a few times 
greater than the average density in the universe. The filaments connect quasi- 
spherical structures, in which the density is larger than in filaments. 

Baryons fall into the potential wells created by the cold dark matter and form 
luminous galaxies. Most of the galaxies are concentrated in the quasi-spherical 
inhomogeneities corresponding to clusters and superclusters of galaxies with scales 
of order of a few Mpc. Some of them reside in filaments which have scales of 10—30 
Mpc, and others are yet in pancakes. 

The complete quantitative theory of the structure formation is a challenging 
numerical problem and represents a frontier of current research. 


i) 


Gravitational instability in General Relativity 


The Newtonian analysis of gravitational instability has limitations. It clearly fails 
for perturbations on scales larger than the Hubble radius. In the case of a relativistic 
fluid we have to use General Relativity for both short-wavelength and long-wave- 
length perturbations. This theory gives us a unified description for any matter on 
all scales. Unfortunately, the physical interpretation of the results obtained is less 
transparent in General Relativity than in Newtonian theory. The main problem 
is the freedom in the choice of coordinates used to describe the perturbations. In 
contrast to the homogeneous and isotropic universe, where the preferable coordinate 
system is fixed by the symmetry properties of the background, there are no obvious 
preferable coordinates for analyzing perturbations. The freedom in the coordinate 
choice, or gauge freedom, leads to the appearance of fictitious perturbation modes. 
These fictitious modes do not describe any real inhomogeneities, but reflect only 
the properties of the coordinate system used. 

To demonstrate this point let us consider an undisturbed homogeneous isotropic 
universe, where ¢(x, t) = e(t). In General Relativity any coordinate system is al- 
lowed, and we can in principle decide to use a “new” time coordinate f, related to 
the “old” time t via 7 = ¢ + 5t(x, t). Then the energy density &(7, x) = e(¢(7, x)) 
on the hypersurface ¢ = const depends, in general, on the spatial coordinates x 
(Figure 7.1). Assuming that dt < ft, we have 


o(¢) = o( — St(x,1)) ~ o(?) — x81 = 5G) 43s @ 3). (7.1) 


The first term on the right hand side must be interpreted as the background en- 
ergy density in the new coordinate system, while the second describes a linear 
perturbation. This perturbation is nonphysical and entirely due to the choice of the 
new “disturbed” time. Thus we can “produce” fictitious perturbations simply by 
perturbing the coordinates. Moreover, we can “remove” a real perturbation in the 
energy density by choosing the hypersurfaces of constant time to be the same as 
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t=const, € = const 





= const, € # const 


Fig. 7.1. 


the hypersurfaces of constant energy: in this case de = 0 in spite of the presence 
of the real inhomogeneities. 

To resolve real and fictitious perturbation modes in General Relativity, it is 
necessary to have a full set of variables. To be precise we need both the matter field 
perturbations and the metric perturbations. 

In this chapter we introduce gauge-invariant variables, which do not depend on 
the particular choice of coordinates and have a clear physical interpretation. We 
apply the formalism developed to study the behavior of relativistic perturbations in 
a few interesting cases. To simplify the formulae we consider only a spatially flat 
universe. The generalization of the results obtained to nonflat universes is largely 
straightforward. 


7.1 Perturbations and gauge-invariant variables 


Inhomogeneities in the matter distribution induce metric perturbations which can 
be decomposed into irreducible pieces. In the linear approximation different types 
of perturbations evolve independently and therefore can be analyzed separately. In 
this section we first classify metric perturbations, then determine how they trans- 
form under general coordinate (gauge) transformations and finally construct the 
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gauge-invariant variables. The relation between the different coordinate systems 
prevalent in the literature is also discussed. 


7.1.1 Classification of perturbations 


The metric of a flat Friedmann universe with small perturbations can be written as 
ds? = [gag + S8ap(x")]dx*dx*, (7.2) 

where |dgag| << | gap |. Using conformal time, the background metric becomes 
Oo pdxdx? = a’(n)(dn? _ 8ijdx'dx!) ‘ (7.3) 


The metric perturbations dg, can be categorized into three distinct types: scalar, 
vector and tensor perturbations. This classification is based on the symmetry prop- 
erties of the homogeneous, isotropic background, which at a given moment of time 
is obviously invariant with respect to the group of spatial rotations and translations. 
The dg99 component behaves as a scalar under these rotations and hence 


5g00 = 2a’, (7.4) 


where ¢ is a 3-scalar. 
The spacetime components 6g9; can be decomposed into the sum of the spatial 
gradient of some scalar B and a vector S; with zero divergence: 


530: = a°(B; + S;). (7.5) 


Here acomma with index denotes differentiation with respect to the corresponding 
spatial coordinate, e.g. B,; = 0B/dx'. The vector S; satisfies the constraint si =0 
and therefore has two independent components. From now on the spatial indices 
are always raised and lowered with the unit metric 6;; and we assume summation 
over repeated spatial indices. 

Ina similar way, the components 6g;;, which behave as a tensor under 3-rotations, 
can be written as the sum of the irreducible pieces: 


égij = a’ (25; ae ge ig Ae hg hij) ; (7.6) 


Here w and E are scalar functions, vector F; has zero divergence (F y = 0) and the 
3-tensor h;; satisfies the four constraints 


h=0) He =0, (7.7) 


that is, it is traceless and transverse. Counting the number of independent functions 
used to form 6g.g, we find we have four functions for the scalar perturbations, four 
functions for the vector perturbations (two 3-vectors with one constraint each), and 
two functions for the tensor perturbations (a symmetric 3-tensor has six independent 


292 Gravitational instability in General Relativity 


components and there are four constraints). Thus we have ten functions altogether. 
This number coincides with the number of independent components of dgug. 

Scalar perturbations are characterized by the four scalar functions ¢, Ww, B, 
E.. They are induced by energy density inhomogeneities. These perturbations are 
most important because they exhibit gravitational instability and may lead to the 
formation of structure in the universe. 

Vector perturbations are described by the two vectors §; and F; and are related to 
the rotational motions of the fluid. As in Newtonian theory, they decay very quickly 
and are not very interesting from the point of view of cosmology. 

Tensor perturbations h;; have no analog in Newtonian theory. They describe 
gravitational waves, which are the degrees of freedom of the gravitational field 
itself. In the linear approximation the gravitational waves do not induce any per- 
turbations in the perfect fluid. 

Scalar, vector and tensor perturbations are decoupled and thus can be studied 
separately. 


7.1.2 Gauge transformations and gauge-invariant variables 


Let us consider the coordinate transformation 
xX > FP = HTH EM, (7.8) 


where &® are infinitesimally small functions of space and time. At a given point 
of the spacetime manifold the metric tensor in the coordinate system X can be 
calculated using the usual transformation law 


ax” ax? 
OX” AXF 








Bap (¥") = 8y3(x?) OO gap(x”) + b8ap — asks —'gyp&%,, (7.9) 


where we have kept only the terms linear in dg and &. In the new coordinates * the 
metric can also be split into background and perturbation parts, 


Bap (E?) = gapl¥") + SBap, (7.10) 


where 9, is the Friedmann metric (7.3), which now depends on ¥. Comparing 
the expressions in (7.9) and (7.10) and taking into account that 


Og g(x?) %  gag(%") —O gag EY, (7.11) 


we infer the following gauge transformation law: 


58ap > S8ap = d8ap — Sap.yE” — ge ypé”, —Ogasé’s. (7.12) 
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Problem 7.1 Consider a 4-scalar g(x?) = q(x?) + 6q, where g is its back- 
ground value, and verify that the perturbation 6q transforms under (7.8) as 


6q — 6G = 6g — gue”. (7.13) 
Similarly, show that for a covariant 4-vector, 
Sq > Siig = bUg — Ug, yE” — uy E%,. (7.14) 


Of course the value of a 4-scalar g at a given point of the manifold does not change 
as aresult of the coordinate transformation, but the way we split it into a background 
value and a perturbation depends on the coordinates used. 


Let us write the spatial components of the infinitesimal vector &* = (&°, &') as 


gamete, (7.15) 
where & it is a 3-vector with zero divergence (& ie j= 0) and ¢ is a scalar function. 
Since in the Friedmann universe © go) = a?() and g;; = —a?(n) 4;;, we obtain 


from (7.12) 
5800 = Sgo0 — 2a(aé?)’, 
580: = dgoi +a” Et +(¢/ — °) | (7.16) 
88;; = 58;; ta’ [2c aye" + 26:5; +(E1i,j + £11) | : 

where €,; = & ‘a and a prime denotes the derivative with respect to conformal time 7. 


Combining these results with (7.4)-(7.6), we immediately derive the transformation 
laws for the different types of perturbations. 


Scalar perturbations For scalar perturbations the metric takes the form 
ds? =a’ (1 + 2)dn* + 2B ;dx'dn —(1 — 2) 8;; — 2Ej;) dx'dx!/]. (7.17) 
Under the change of coordinates we have 
& 1 2 
$>o=¢-—(aé), B>B=Bt+E'-6, 
ae (7.18) 
~ a ~ 
vo vawt—®, E>E=E+¢C. 


Thus, only €° and ¢ contribute to the transformations of scalar perturbations and by 
choosing them appropriately we can make any two of the four functions ¢, y, B, E 
vanish. The simplest gauge-invariant linear combinations of these functions, which 
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span the two-dimensional space of the physical perturbations, are 


1 a’ 


o=¢-—[a(B-E)], vY=~+—(B-E). (7.19) 


a 
It is easy to see that they do not change under the coordinate transformations 
and if ® and W vanish in one particular coordinate system, they will be zero 
in any coordinate system. This means we can immediately distinguish physical 
inhomogeneities from fictitious perturbations; if both ® and W are equal to zero, 
then the metric perturbations (if they are present) are fictitious and can be removed 
by a change of coordinates. 

Of course one can construct an infinite number of gauge-invariant variables, 
since any combination of ® and W will also be gauge-invariant. Our choice of 
these variables is justified only by reason of convenience. As with the electric and 
magnetic fields in electrodynamics, the potentials ® and are the simplest possible 
combinations and satisfy simple equations of motion (see the following section). 


Problem 7.2 Using the results in Problem 7.1, verify that 
dé = de — €)(B — E’) (7.20) 
is the gauge-invariant variable characterizing the energy density perturbations. 


Taking into account that the 4-velocity of a fluid in a homogeneous universe is 
Ou, =(a, 0,0, 0), show that 


Sup = duo —[a(B—E’)]’, Su; = du; —a(B- E’) (7.21) 


are the gauge-invariant variables for the covariant components of the velocity per- 
turbations duy. 
Vector perturbations For vector perturbations the metric is 
ds? =a’ [dn + 2S;dx'dn —(8;; — Fi; — Fj,i)dx'dx'], (7.22) 
and the variables S; and F; transform as 
S,>58,=S, +8, Fim R= Fi + &1i. (7.23) 
It is obvious that 
V,=S;-F/ (7.24) 


is gauge-invariant. Only two of the four independent functions S;, F; characterize 
physical perturbations; the other two reflect the coordinate freedom. The vari- 
ables (7.24) span the two-dimensional space of physical perturbations and describe 
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rotational motions. The corresponding covariant components of the rotational ve- 
locity 6u1;, satisfying the condition(éu,;)’ = 0, are also gauge-invariant. 


Tensor perturbations For tensor perturbations, 
ds? =a*[dn* —(5;; — hij) dx'dx!] (7.25) 


and h;; does not change under coordinate transformations. It already describes the 
gravitational waves in a gauge-invariant manner. 


7.1.3 Coordinate systems 


Gauge freedom has its most important manifestation in scalar perturbations. We 
can use it to impose two conditions on the functions ¢, yw, B, E, de and the potential 
velocity perturbations du,,; = g,;. This is possible since we are free to choose the two 
functions €° and ¢. Imposing the gauge conditions is equivalent to fixing the (class 
of) coordinate system(s). In the following we consider several choices of gauge and 
show how, knowing the solution for the gauge-invariant variables, one can calculate 
the metric and density perturbations in any particular coordinate system in a simple 
way. 


Longitudinal (conformal-Newtonian) gauge Longitudinal gauge is defined by the 
conditions B; = E; = 0. From (7.18), it follows that these conditions fix the co- 
ordinate system uniquely. In fact, the condition E; = 0 is violated by any ¢ 4 0, 
and using this result we see that any time transformation with €° 4 0 destroys 
the condition B; = 0. Hence there is no extra coordinate freedom which preserves 
B, = E; = 0. In the corresponding coordinate system the metric takes the form 


ds? = a’((1 + 2¢))dn* —(1 — 2W)8;jdx'dx!]. (7.26) 


If the spatial part of the energy-momentum tensor is diagonal, that is, éT; x Si, 
we have ¢; = Wy (see the following section) and there remains only one vari- 
able characterizing scalar metric perturbations. The variable ¢; is a generalization 
of the Newtonian potential, which explains the choice of the name “conformal- 
Newtonian” for this coordinate system. As can be seen from (7.19)-(7.21), the 
gauge-invariant variables have a very simple physical interpretation: they are the 
amplitudes of the metric, density and velocity perturbations in the conformal- 
Newtonian coordinate system, in particular, ® = ¢@, WV = wy. 


Synchronous gauge Synchronous coordinates, where dgo, = 0, have been used 
most widely in the literature. In our notation, they correspond to the gauge choice 
os; = Oand B, = 0. This does not fix the coordinates uniquely; there exists a whole 
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class of synchronous coordinate systems. From (7.18), it follows that if the condi- 
tions ¢; = 0 and B,; = O are satisfied in some coordinate system x* =(7, x), then 
they will also be satisfied in another coordinate system x”, related to x* by 


; ; d 
Le ncaa: x axe f Teen, (7.27) 


where C; =C; (x/ ) and C, =C) (x/ ) are arbitrary functions of the spatial 
coordinates. This residual coordinate freedom leads to the appearance of unphysi- 
cal gauge modes, which render the interpretation of the results difficult, especially 
on scales larger than the Hubble radius. 

If we know a solution for perturbations in terms of the gauge-invariant variables 
or, equivalently, in the conformal-Newtonian coordinate system, then the behavior 
of perturbations in the synchronous coordinate system can easily be determined 
without needing to solve the Einstein equations again. Using the definitions in 
(7.19) we have 


1 / 
@=-[aE'], V=¥,-—E’. (7.28) 
a a 


These two equations can easily be resolved to express yy, and E, in terms of the 
gauge-invariant potentials: 


wa 
E, -/-(/ adn) dn, W;= v4 [ avan, (7.29) 
a a 


Similarly, from (7.20) it follows that the energy density perturbations are 
bas: Ao 
dé, = de — 2 / a®dn. (7.30) 
a 


The constants of integration arising in these formulae correspond to unphysical, 
fictitious modes. 


Problem 7.3 Impose the comoving gauge conditions 


1 1 
¢=0, dui = 79 OM + Fa = 0, (7.31) 


' 
where du’, are the contravariant spatial components of the potential 4-velocity, and 
find the metric perturbations in the comoving coordinate system in terms of the 
gauge-invariant variables. Do these conditions fix the coordinates uniquely? 
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7.2 Equations for cosmological perturbations 


To derive the equations for the perturbations we have to linearize the Einstein 
equations 


a__ pa lea 23 a 
“= RG — 43R = 8nGTs, 


for small inhomogeneities about a Friedmann universe. The calculation of the 
Einstein tensor for the background metric (7.3) is very simple and the result is 


Ogos wg eo Gi = (on! + H)6;; (7.32) 
0 g2’ aa I~ g2 ‘Wy? . 


where H = a’/a. It is clear that in order to satisfy the background Einstein equa- 
tions, the energy-momentum tensor for the matter, T;, must have the following 
symmetry properties: 


sy = 0, OT x Or (7.33) 


For a metric with small perturbations, the Einstein tensor can be written as 
Gs = OG + dG, +.---, where dG% denote terms linear in metric fluctuations. 
The energy-momentum tensor can be split in a similar way and the linearized 
equations for perturbations are 


5G% = 8nG6TZ. (7.34) 


Neither 5G nor 57;' are gauge-invariant. Combining them with the metric per- 
turbations, however, we can construct corresponding gauge-invariant quantities. 


Problem 7.4 Derive the transformation laws for 57; and verify that 
Ty = 879 — (70) (B- B’), 
va = 679 —T? —T}/3)(B - E’) ,, (7.35) 


aT, = 67) — T!) (B-E’), 


where Te is the trace of the spatial components, are gauge-invariant. 
In a similar manner to (7.35), we can construct 
5Gy = G9 —(G?)'(B- E’), etc. (7.36) 
and rewrite (7.34) in the form 


5G, = 81GOTy. (7.37) 
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The components of 67, can also be decomposed into scalar, vector and tensor 
pieces; each piece contributes only to the evolution of the corresponding perturba- 
tion. 


Scalar perturbations The left hand side in (7.37) is gauge-invariant and depends 
only on the metric perturbations. Therefore it can be expressed entirely in terms of 
the potentials ® and W. The direct calculation of 5G, for the metric (7.17) gives 
the equations: 


AW —3H(W' + HO) = 42Ga78Ty, (7.38) 


(W' +H), =4Ga5T,, (7.39) 


i 
, 


[W" + HW + )' +(2H' + H?) © + SA(® — W)] 5; 


ae (7.40) 


We have to stress that these equations can be derived without imposing any gauge 
conditions and they are valid in an arbitrary coordinate system. To obtain the explicit 
form of the equations for the metric perturbations in a particular coordinate system, 
we simply have to express ® and W in (7.38)-(7.40) through these perturbations. 
For example, in the synchronous coordinate system, we would use the expressions 
in (7.28). 


Problem 7.5 Write down the equations for y, and EF, in the synchronous coordinate 
system. (Hint Do not forget that E, enters the definition of 5T 5.) 


The equations for the metric perturbations in the conformal-Newtonian coordi- 
nate system obviously coincide with (7.38)—(7.40). Therefore calculations in these 
coordinates are identical to calculations in terms of the gauge-invariant potentials, 
with the advantage that we need not carry B and E through the intermediate for- 
mulae. 


Problem 7.6 Derive (7.38)—(7.40). (Hint: The direct calculation of 5G; in terms 
of the gauge-invariant potentials is rather tedious. However, it can be significantly 
simplified if we take into account that these potentials coincide with the metric 
perturbations in the conformal-Newtonian coordinate system. Therefore, calculate 
the Einstein tensor for the metric (7.26) and then replace ¢;, Ww with ® and WV 
respectively. It is convenient to calculate dG% in two steps: (a) set a = | in (7.26) 
and find the Einstein tensor in this case, (b) make a conformal transformation to an 
arbitrary a(t) and calculate 6 Gs using formulae (5.110), (5.111), where F = a?.) 
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Vector perturbations The equations for the vector perturbations take the forms 
AV; = 161 Ga75T yy), (7.41) 
(Vij + Vii) + 2H (Vi. + Vii) — 161 Ga?5T yy), (7.42) 
where V; is defined in (7.24) and BT ay is the vector part of the energy-momentum 
tensor. 
Tensor perturbations For the gravitational waves we obtain 
Imi 
(hy, + 2Hhj, — Ahij) = 161 Ga" sT jr), (7.43) 


where BT ir) is that part of the energy-momentum tensor which has the same 
structural form as h;;. 


Problem 7.7 Derive (7.41), (7.42) and (7.43). 


7.3 Hydrodynamical perturbations 


Let us consider a perfect fluid with energy-momentum tensor 
Ti =(e + p)u“up — poy. (7.44) 


One can easily verify that its gauge-invariant perturbations, defined in (7.35), can 
be written as 


= 1 = = ee 
8Ty = 58, 5T, = —(e0+ po)( Sma + dusi), 57; = —dpsi, (7.45) 
; : 


where S¢, 5u,; and 5p are defined in (7.20), (7.21). The only term, which contributes 
to the vector perturbations is proportional to du1;; all other terms have the same 
structural form as the scalar metric perturbations. 


7.3.1 Scalar perturbations 
Since éT; = 0 fori # j, (7.40) reduces to 
(P-W);,=0 GF). (7.46) 


The only solution consistent with ® and W being perturbations is WY = ®. Then 
substituting (7.45) into (7.38)-(7.40) we arrive at the following set of equations for 
the scalar perturbations: 


A® —3H(! + HO) = 41Ga’se, (7.47) 
(a), = 4 Ga*(e9 + po) dui. (7.48) 


©" + 3H®' +(2H’ + H?)® = 40Ga’ dp. (7.49) 
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In anonexpanding universe 7{ = 0, and the first equation exactly coincides with the 
usual Poisson equation for the gravitational potential. In an expanding universe, the 
second and third terms on the left hand side in (7.47) are suppressed on sub-Hubble 
scales by a factor ~ 4/H~', and hence can be neglected. Thus (7.47) generalizes 
the Poisson equation and supports the interpretation of ® as the relativistic gen- 
eralization of the Newtonian gravitational potential. Note that, on scales smaller 
than the Hubble radius, (7.47) can be applied even to nonlinear inhomogeneities, 
because it requires only |®| < 1 but not necessarily |d¢/eo| « 1. From (7.48) it 
follows that the time derivative of(a®)’ serves as the velocity potential. 

Given p(e, S), the pressure fluctuations 5p can be expressed in terms of the 
energy density and entropy perturbations, 


dp = c2e + 15S, (7.50) 


where c? =(dp/dé)g is the square of the speed of sound and t =(0p/0S), . Taking 
this into account and combining (7.47) and (7.49), we obtain the closed form 
equation for the gravitational potential 


®" +3(1+c¢2) H®’ — cA® +(2H! +(1 + 3c?) H’)® = 4Ga'rd5S. (7.51) 


Below we begin by finding the exact solutions of this equation in two particular 
cases: (a) for nonrelativistic matter with zero pressure, and (b) for relativistic fluid 
with constant equation of state p = we. Then we analyze the behavior of adiabatic 
perturbations (6S = 0) for a general equation of state p(¢), and finally consider 
entropy perturbations. 


Nonrelativistic matter (p = 0) Ina flat matter-dominated universe a « n? and H = 
2/n. In this case (7.51) simplifies to 





6 
©” + —o’=0, (7.52) 
U 
and has the solution 
C2(x 
o= c+, (7.53) 


where C(x) and C2(x) are arbitrary functions of the comoving spatial coordinates. 
From (7.47) we find the gauge-invariant density perturbations 
dé 1 , 4 1 
— = =|(ACn* — 12C,) +(ACon? + 18C2) =} . (7.54) 
€0 6 n> 
The nondecaying mode of the gravitational potential, C;, remains constant regard- 
less of the relative size of the lengthscale of the inhomogeneity relative to the 
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Hubble radius. The behavior of the energy density perturbations, however, depends 
crucially on the scale. 

Let us consider a plane wave perturbation with comoving wavenumber k = |k|, 
for which C;,. « exp(ikx) . If the physical scale 1.,, ~ a/k is much larger than the 
Hubble scale H~! ~ an or, equivalently, kn < 1, then to leading order 

2 ~-2C,+ a (7.55) 
€0 1) 
Neglecting the decaying mode, the relation between the energy density fluctuations 
and the gravitational potential on superhorizon scales becomes 5¢/e9 ~ —2®. 
For shortwave perturbations with kn >> 1, we have 
ie 


ae a (Cin’ + Con’) = Gir + Cot, (7.56) 
€0 


in agreement with the Newtonian result (6.56) 


Problem 7.8 Determine the behavior of the peculiar velocity for nonrelativistic 
matter. 


Problem 7.9 Substituting (7.53) and (7.54) in (7.29) and (7.30 ), calculate the met- 
ric and energy density perturbations in the synchronous coordinate system. Analyze 
the behavior of the long- and short-wavelength perturbations. Is the Newtonian limit 
explicit in this coordinate system? 


Ultra-relativistic matter Let us now study the behavior of adiabatic perturbations 
(6S = 0) in the universe dominated by relativistic matter with equation of state 
Dp = we, where w is a positive constant. In this case the scale factor increases as 
a x n7/!+3) (see Problem 1.18). With c? = w, and for a plane wave perturbation 
® = ®,(n) exp(ikx), (7.51) becomes 

6(1+ w) 1 


Of, + 1ata a + wk’, = 0. (7.57) 


The solution of this equation is 





PD, = n*[CiJy(/wkn) of C2¥,(/wkn)| >» dV 


1/543 
al am 
2\1+3w 


where J, and Y, are Bessel functions of order v. 

Considering long-wavelength inhomogeneities, for which ./wkn < 1, and us- 
ing the small-argument expansion of the Bessel functions, we see that in this limit 
the nondecaying mode of ® is constant. It follows from (7.47) that 


bé/e9 ~ —2®. (7.59) 
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Perturbations on scales smaller than the Jeans length A; ~ cst, for which 
./wkn > 1, behave as sound waves with decaying amplitude 


PD, «x no2 exp(ti/wkn) . (7.60) 


In a radiation-dominated universe(w = 1/3), the order of the Bessel functions 
is v = 3/2 and they can be expressed in terms of elementary functions. We have 


1 sin x cos x ; 
O, = a: C,| —— —cosx + Cof + sinx) ; (7.61) 
x x 


x 





where x = kn//3. The corresponding energy density perturbations are 


be (ee sin x sin x 
—=2C; 5 —— —cosx ]| — —— 
E0 x x x 


| eee ee 
+4C5 ( a \e 4 sinx) ce | (7.62) 
Xx x 2 











General case Unfortunately, (7.51) cannot be solved exactly for an arbitrary equa- 
tion of state p(€). However, it turns out to be possible to derive asymptotic solu- 
tions for both long-wavelength and short-wavelength perturbations. To do this, it 
is convenient to recast the equation in a slightly different form. The “friction term” 
proportional to ®’ can be eliminated if we introduce the new variable 


u= exp(5 [o +c) Han) (7.63) 


1 f Z ® 
= exp(—5 {[( + 2) — 1 an)e = —— | 
2 9 / (€0 + Po) (eo + po)'!/ 


where we have used c? = po /€> and expressed 1 in terms of ¢ and p via the 


conservation law «’ = —3H(e + p). After some tedious calculations, and using 
the background equations (see (1.67), (1.68)) 
81G 
H = 0°89, WH! =42Ga(eo + po), (7.64) 


the equation for u can be written in the form 


6 
uu” — co Au — ry = 0, (7.65) 


1 Po “12 47/9 HOH ve 
p= 2(1+2) =7G('-#)) : wy 


where 
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For a plane wave perturbation, u « exp(ikx), the solutions of (7.65) can easily be 
found in two limiting cases: on scales much larger and scales much smaller than 
the Jeans length. 

Long-wavelength perturbations When c,kn « 1, we can omit the spatial deriva- 
tive term in (7.65) and then u « @ is obviously the solution of this equation. The 
second solution is derived using the Wronskian, 


dn dn 
Uu ~co+c0 | F ry = | F. (7.67) 


no 70 
where the latter equality is obtained by changing the lower limit of integration 
from no to jo and, thus, absorbing the C; mode. Using the definition in (7.66) and 
integrating by parts, we reduce the integral in (7.67) to 


hk 2 
=2fo[io(a)]=iG- fom) 0m 


With this result the gravitational potential becomes 


d (1 
© =(e9 + po)? u = a(t = * [ ean) = aS (Z f aar) (7.69) 
a dt\a 


where t = f adn. 

Let us apply this result to find the behavior of long-wavelength, adiabatic per- 
turbations (5S = 0) in a universe with a mixture of radiation and cold baryons. In 
this case the scale factor increases as 


a(n) = deq(&? + 28), (7.70) 
where € = n/n, (see (1.81)) Substituting (7.70) into (7.69), we obtain 
sar! 1 13 1 
=eialt (38 Hite ats) en: ae 


where A and B are constants of integration multiplying the nondecaying and de- 
caying modes respectively. 





Problem 7.10 Calculate the energy density fluctuations. 


The behavior of the gravitational potential and the energy density perturbation 
for an inhomogeneity which enters the Hubble horizon after equality, is shown in 
Figure 7.2, where we have neglected the decaying mode. We see that ® and de/e9 
are constant at times both early and late compared to n., ~ n,. After the transition 
from the radiation- to the matter-dominated epoch, the amplitude of both ® and 
5é/€o decreases by a factor of 9/10. During the period of matter domination, © 
remains constant while 5¢ /éo begins to increase after the perturbation enters horizon 
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Neq ~kl UT] 


Fig. 7.2. 


at 1 ~ k~!. It follows from (7.47) that for a constant potential ©, the amplitude of 
b5€/€9 is always equal to —2® on superhorizon scales. 

The change in the amplitude of ® can also be inferred from a widely used 
“conservation law” for the quantity 


2(8xG\'"_, puy! 
i (3) #(5) og 


Substituting the long-wavelength solution (7.67) into (7.72), we see that ¢ remains 
constant (is conserved) even if w = p/e is changing. Recalling the definitions of 
u, 8 and using the background equations of motion, ¢ becomes 


_ 2H 18+ 


7.73 
3 |l+w ( ) 


Let us assume the equation of state w is initially a constant w;, and later on changes 
to another constant w ,. In this case the initial and final values of ® are also constants 


7.3 Hydrodynamical perturbations 305 


and it immediately follows from (7.73) that 


1 5+ 3wu; 
pn (0 Mo Ne. (7.74) 
: 1+ wu; s+ 3w ef 
For a matter—radiation universe, w; = 1/3 and wy = 0, and we obtain the familiar 
result ® ¢ =(9/10) 9;. 





Problem 7.11 Verify that for a mode with wavenumber k, equation (7.65) can be 
rewritten in the following integral form: 


UT 
min) = C0 + C30 | FE —1o | [ eousan al aa (7.75) 
k 1 2 p2 5 UUk xn) ; 


Using this equation, calculate the subleading k?-corrections to the long-wavelength 
solution (7.67) and determine the violation of the “conserved” quantity ¢. 


Short-wavelength perturbations When c,kn > 1, the last term in (7.65) can be 
neglected. The resulting equation, 


u" + c?k’u ~ 0, (7.76) 


can easily be solved in the WKB approximation for a slowly varying speed of 
sound. Its solution describes sound waves with the time-dependent amplitude. 


Matching conditions Sometimes it is convenient to approximate the continuous 
change of the equation of state by a sharp jump. In this case the pressure p(€) is 
discontinuous on the hypersurface of transition &, €7 = const, and its derivatives 
become singular.Therefore we cannot directly use the equation for the gravitational 
potential and, instead, must derive matching conditions for ® and ’ on ¥. These 
conditions can be obtained if we recast (7.65) in the following form: 


! / 
(=) 2 <26?a(-) ; (7.77) 
0 0 
Evidently u/@ should be continuous. Because the scale factor a and the energy 
density ¢ are both continuous, the gravitational potential ® does not jump during 
the transition, or equivalently, the 3-metric induced on & is continuous. 


To determine the jump in the derivative of u/@ let us integrate (7.77) within an 
infinitesimally thin layer near &: 


x=+0 


(5) |= f deta(p)an (1.18) 


x—0 
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where[X ], = X+ — X_ denotes the jump of a variable X on &. The integrand in 
(7.78) is singular and to perform the integration we note that 


a= (2) aacen) ~ salar) ~ ae 
: 9 / \a? €0 + Po 3a7H \ €0 + po a?(€o + po) 


The second term in the latter equality does not contribute to the integral, while the 


first term gives 
z+0 
J cera(son=[ s(t) aC) 
: 6 3a7H \ €0 + Po OF 
0 











(7.79) 





taking into account that a, ¢ and u/@ are continuous. Substituting (7.79) into (7.78) 
and expressing u in terms of the gravitational potential, we finally arrive at the 
following matching conditions: 


= 0, (7.80) 


ae 


[Oo], =0 2 A® 
oe 921+ w 
where ¢ is defined in (7.72), (7.73). For long-wavelength perturbations the term 
proportional to A® can be neglected. Hence on superhorizon scales the matching 


conditions reduce to the continuity of ® and to the conservation law for ¢. 


Problem 7.12 Assuming a sharp transition from the radiation- to the matter-dom- 
inated epoch, determine the amplitude of metric perturbations after the transition 
for both short- and long-wavelength perturbations. 


Problem 7.13 Write down the matching conditions explicitly in terms of the metric 
perturbations in the synchronous coordinate system. 


Entropy perturbations Until now we have been considering adiabatic perturba- 
tions in an isentropic fluid where the pressure depends only on the energy density. 
In a multi-component media both adiabatic and entropy perturbations can arise. 
Generally speaking, the analysis of perturbations in this case is rather complicated 
because of extra instability modes due to the relative motion of the components. We 
will consider the problem of cold matter mixed with a baryon-radiation plasma in 
Section 7.4. Here we content ourselves with studying a fluid of cold baryons tightly 
coupled to radiation. The baryons do not move with respect to the radiation and this 
simplifies our task enormously. In particular, we can still use the one-component 
perfect fluid approximation. However, in this case the pressure depends not only on 
the energy density, but also on the baryon-to-radiation distribution, characterized 
by the entropy per baryon, S ~ T /nyp, where ny, is the number density of baryons. 
Consequently, entropy perturbations can arise. We can study the evolution of these 
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perturbations with the help of (7.51). Here 5S is constant because the entropy per 
baryon is conserved. 

First we need to determine the parameter t =(0p/0S), entering (7.51). The 
cold baryons do not contribute to the pressure and hence the fluctuations of the 
total pressure are entirely due to the radiation: 


dp = dp, = 46ey. (7.81) 
In turn, 5¢, can be expressed in terms of the total energy density perturbation 
dé = b&, + b& (7.82) 


and the entropy fluctuation 5S. Because the energy density of the radiation, ¢,,, is 
proportional to iy and €, « np, we have S « ey! [ /ép and therefore 
6S = 3de, d& 


— _ ‘ 7.83 
S 46, Eb ( ) 





Solving (7.82) and (7.83) for de, in terms of d¢ and 5S, and substituting the result 
into (7.81) we obtain 


1 Beye ec, 3 e,\~' 8 
ép= (1 + 72) de + seo( + 72) ce (7.84) 


T: 
mn C-eh 
2 Ss 
Sy) ae ey, = : 7.85 
‘s =( i) . S A) 


For 6S 4 0, the general solution of (7.51) is the sum of a particular solution and 
a general solution of the homogeneous equation (6S = 0). To find a particular 
solution, we note that 





OH’ +(1 + 3c;) H = 82 Ga" (cee = p) = 2n Gere. 


Substituting this expression and t from (7.85) into (7.51), we immediately see that 
for the long-wavelength perturbations, for which the A® term can be neglected, 


6S 
d= 2 = const (7.86) 


is a particular solution of this equation. 

Physically, the general solution of (7.51), when 6S 4 0, describes a mixture 
of adiabatic and entropy modes. How to distinguish between them is a matter of 
definition. Based on the intuitive idea that in the very early universe the entropy 
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mode should describe inhomogeneities in the baryon distribution on a nearly ho- 
mogeneous radiation background , we define the entropy mode to have the initial 
condition 


® > Oasn > 0. (7.87) 


Obviously, the particular solution (7.86) does not satisfy this condition. The solution 
which does is obtained by adding to (7.86) a general solution (7.71) and choosing 
the integration constants so that (7.87) is satisfied. The result is 
= Late +1058 
3? (G2 SS 
Problem 7.14 Calculate d¢/e and d¢,/ép. 





(7.88) 


In Figure 7.3 we plot the time dependence of ®, d¢/e and d¢;/e, for the entropy 
mode. The amplitudes of ® and d¢/e increase linearly until n.,, whereas they are 
constant for adiabatic perturbations. The fluctuations in the cold-matter density 
de,/€p are somewhat frozen before 7,, and decrease to 2/5 of their initial values 





Wes ~kt ” 


Fig. 7.3. 
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by the time of equality. For 7 > neq, the entropy perturbations evolve like the non- 
decaying mode of the adiabatic perturbations. There is a key difference, however: 
it follows from (7.83) that for adiabatic perturbations (6S = 0) we always have 


b& 4 de 
tn (7.89) 
&y 3 & 
whereas for entropy perturbations 
6 5 
sal ey a (7.90) 
Ey Eb 


after equality. 

We can also define the isocurvature mode of perturbations by imposing the 
conditions ©; = 0 and ©) = 0 at some initial moment of time n; 4 0. One can 
easily verify that this mode soon approaches the entropy mode. 


7.3.2 Vector and tensor perturbations 


For a perfect fluid, the only nonvanishing vector components of 57; are aT = 
a7'(€9 + po) du 4;. Equations (7.45) become 


AV; = 16mGa(éo + po) bu 4;, (7.91) 
(Vij + Vii) + 2H( Vi; + Vi) => 0. (7.92) 


The solution of the second equation is 


= Ci 
Via 





a ’ 7.93 
S (7.93) 
where C,; is aconstant of integration. Taking into account that the physical velocity 
is du! = a(dx' /ds) = —a~'$u,;, we obtain 
1 
bu) x ————— (7.94) 


a : 
a*(éo + po) 
Thus, in a matter-dominated universe, the rotational velocities decay in inverse 
proportion to the scale factor, in agreement with the result of Newtonian theory. In 
a radiation-dominated universe d5v = const. In both cases the metric perturbations 
given by (7.93) decay very quickly and the primordial vector fluctuations have 
significant amplitudes at present only if they were originally very large. There is 
no reason to expect such large primordial vector perturbations and from now on we 
will completely ignore them. 
Tensor perturbations are more interesting, and as we will see in the following 
chapter, they can be generated during an inflationary stage. In a hydrodynamical 
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universe with perfect fluid OF a = 0, (7.43) simplifies to 


(hi; + 2Hh;, - Ahij) = 0. (7.95) 
Introducing the rescaled variable v via 
v 
hi; = —eij, (7.96) 
a 


where e;; is a time-independent polarization tensor, and considering a plane wave 
perturbation with the wavenumber k , (7.95) becomes 


a 


” 2 Og 
vi +[k°-—-—]v=0. (7.97) 


In a radiation-dominated universe a « n, hence a” = 0 and v « exp(+tikn). In 
this case the exact solution of (7.95) is 


1 
hij = —(C; sin(kn) + C2 cos(kn)) e;;. (7.98) 
0 


The nondecaying mode of the gravitational wave with wavelength larger than the 
Hubble scale (kn « 1) is constant. After the wavelength becomes smaller than 
the Hubble radius, the amplitude decays in inverse proportion to the scale factor. 
This is a general result valid for any equation of state. In fact, for long-wavelength 
perturbations with kn < 1, we can neglect the k* term in (7.97) and its solution 
becomes 


d 
v= Cat Ca | a (7.99) 
a 
Hence, 
d 
hg (c “ co f *) aa (7.100) 
a 


For p < ¢ the second term describes the decaying mode. 
For short-wavelength perturbations (kn >> 1), we have k* >> a"/a and h « 
exp(tikn) /a. 


Problem 7.15 Find the exact solution of (7.95) for an arbitrary constant equa- 
tion of state p = we and analyze the behavior of the short- and long-wavelengh 
gravitational waves. Consider separately the cases p = +e. 


7.4 Baryon-radiation plasma and cold dark matter 


Understanding the perturbations in a multi-component medium consisting of a mix- 
ture of a baryon-radiation plasma and cold dark matter is important both to analyze 
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the anisotropy of the cosmic microwave background and to determine the transfer 
function relating the primordial spectrum of density inhomogeneities created during 
inflation to the spectrum after matter—radiation equality. As a prelude to analysis of 
the microwave background anisotropies, we consider in this section the calculation 
of the gravitational potential and the radiation fluctuations at recombination. 

Before recombination, baryons are strongly coupled to radiation and the baryon— 
radiation component can be treated in the hydrodynamical approximation as a 
single imperfect fluid. The other component consists of heavy cold particles, which 
we assume interact only gravitationally with the baryon—photon plasma and are 
otherwise free to move with respect to it. 

We assume that the number of photons per cold dark matter particle is initially 
spatially uniform on supercurvature scales (wavelengths greater than H~') but that 
the matter and radiation densities vary in space. In other words, we consider adia- 
batic perturbations. As the universe expands and the inhomogeneity scale becomes 
smaller than the curvature scale, the components move with respect to one another 
and the entropy (number of photons) per cold dark matter particle varies spatially. 
In contrast, the entropy per baryon remains spatially uniform on all scales until the 
baryons decouple from the radiation. 


7.4.1 Equations 


Because the baryon-radiation and cold dark matter components interact only gravi- 
tationally, their energy-momentum tensors satisfy the conservation laws, T;',, = 0, 
separately. The cold particles have negligible relative velocities and hence the cold 
dark matter can be described as a dust-like perfect fluid with zero pressure. In the 
baryon-radiation plasma, the photons can efficiently transfer energy from one re- 
gion of the fluid to another over distances determined by the mean free path of the 
photons (e.g. through diffusion). Shear viscosity and heat conduction play an im- 
portant role in this case and lead to the dissipation of perturbations on small scales 
(Silk damping). In the limit of low baryon density, corresponding to current obser- 
vations, heat conduction is not as important as shear viscosity, and so we will only 
consider the latter. The derivation of the energy-momentum tensor for an imperfect 
fluid is given in many books and we will not repeat it here. The energy-momentum 
tensor is found to be 


Bo =(e + p)uug — py —( Pru,” + Phu®., —3Psu’,), (7.101) 
where 77 is the shear viscosity coefficient and 


3 = 5, — U"Uug 
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is the projection operator. We will analyze small perturbations using the conformal 
Newtonian coordinate system in which the metric takes the form 


ds? = a(n) + 2) dn* —(1 + 2W) djjdx'dx/]. (7.102) 


The potentials ® and W are equal only if the nondiagonal spatial components of 
the energy-momentum tensor are equal to zero. This is obviously not true for an 
imperfect fluid. However, one can easily check that even when the main contribution 
to the gravitational potential comes from the imperfect fluid, the difference ® — V 
is suppressed compared to ®, at least by the ratio of the photon mean free path 
to the perturbation scale. After equality, the gravitational potential is mainly due 
to the cold dark matter and the contribution of the nondiagonal components of the 
energy—momentum tensor can be completely neglected. Therefore we set WV = ®. 
In this case the Christoffel symbols are 


Mo = H+ Os 1 =To = Oi: ry, =((1 — 4®)H-®') 633; (7.103) 
Pio =(H-®') ij; Tig = © jSix — Oi5jx — Odij, | 


and the zero component of the 4-velocity to first order in perturbations is equal to 
1 

uw=-(1—), uy =a(l+®). (7.104) 
a 


Given these relations, it is easy to show that, to zeroth order in the perturbations, 
the relation Ty, = 0 reduces to the homogeneous energy-momentum conservation 
law and 7;",, = 0 is trivially satisfied. To the next order, the equation T5\, = 0 leads 
to 


de! + 3H(5e + dp) — 3(e + p)® +.a(e + p)u'; =0. (7.105) 


Note that the shear viscosity does not appear in this relation. As for the remaining 
equations, 7," = 0, if we are only interested in scalar perturbations, it suffices to 
take the spatial divergence. We find 


1 ae Gabe be 
ave + p)ui,) — znAul + Adp +(€ + p)A® =0. (7.106) 


As already noted, the two equations above are separately valid for the dark matter 
and the baryon-radiation plasma components. 


Problem 7.16 Derive (7.105) and (7.106). 


Dark matter For dark matter, the pressure p and the shear viscosity 7 are both 
equal to zero. Taking into account that ega* = const, we obtain from (7.105) that 
the fractional perturbation in the energy of dark matter component, bg = d&4/ég, 
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satisfies the equation 
(5a — 3)’ + au'; = 0. (7.107) 


Solving for ul in terms of dg and ® and substituting into (7.106), the resulting 
equation takes the form 


(a(5y — 3@)')’ —aA® =0. (7.108) 


Baryon-radiation plasma The baryons and radiation are tightly coupled before 
recombination and, therefore, their energy and momentum are not conserved sep- 
arately. Nevertheless, when the baryons are nonrelativistic, (7.105), in contrast to 
(7.106), is valid for the baryon and radiation components separately, because the 
energy conservation law for the baryons, 7p., = 0, reduces to the conservation law 
for total baryon number. (Specifically, if To’ = mpnpu“uo, where my is the baryon 
mass, then 7), = 0 is equivalent to(n,u")., = 0 up to linear order in perturbations.) 
Hence, the fractional baryon density fluctuation, 6, = d&,/é&p, satisfies an equation 
similar to (7.107): 


(5p — 3) + au'; = 0. (7.109) 


The corresponding equation for the perturbations in the radiation component, 6, = 
d&,/E€y, is, according to (7.105), 


(5, —4)' + 4au', = 0. (7.110) 


Since the photons and baryons are tightly coupled, they move together, and hence 
the velocities entering both of these equations are the same. Multiplying (7.110) by 
3/4, subtracting (7.109), and integrating, we obtain 

eae 6, —46 t (7.111) 

— = -56, — dp = const, : 

eee sae 
where 6S/S is the fractional entropy fluctuation in the baryon-radiation plasma 
(see also (7.83)). Equation (7.111) states that 5$/S is conserved on all scales. In 
the case of adiabatic perturbations, 5S = 0 and therefore 


5, = 38,. (7.112) 


Expressing u' , in terms of 5, and ® and substituting into (7.106), we obtain 


dys *e 4 40’\' 12 
Cs 


AD. 2 
e £ya ec £ya 
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where A is the Laplacian operator and c, is the speed of sound in the baryon— 
radiation plasma given in (7.85). In deriving (7.113), we used the relations 


4 
Etp=€&t+r€ (7.114) 


3°Y ~ 9e2°? 
and eas = const. Neglecting polarization effects, the shear viscosity coefficient 
entering (7.113) is given by 


4 
n= sev” (7.115) 


where Tt, is the mean free time for photon scattering. 

We have, thus far, two perturbation equations, (7.108) and (7.113), for three un- 
known variables, 57, 5, and ®. To these equations, we add the 0 — 0 component of 
the Einstein equations (see (7.38)), which in the case under consideration becomes 


A® — 3HO' — 3H? ® = 41Ga? (Seq + dey + Sey) 


1 
= 41 Ga’ («ads + saver) P (7.116) 
where (7.112) has been used to express 5p in terms of 6,. 
Using (7.110), we obtain a useful relation for the radiation contribution to the 
divergence of the 0 — i components of the energy-momentum tensor: 
Tj, = 4e,uou', =(4® — 6,)' ey, (7.117) 


which will be used in Section 9.3. 


7.4.2 Evolution of perturbations and transfer functions 


If the density fluctuations are decomposed into modes with comoving wavenumber 
k, then their behavior fora given k depends on whetherkn < lor kn > 1.Thecross- 
over from kn < 1 tokn > 1 corresponds to the transition in which a mode changes 
from having a wavelength exceeding the curvature scale to having a wavelength 
less than the curvature scale. In a decelerating universe, as time evolves and 7 
grows, the curvature scale H~' = a/a increases faster than the physical scale of 
the perturbation, 4, ~ a/k, and encompasses modes with smaller and smaller k. 
We shall refer to modes for which kn < 1 as supercurvature modes and those with 
kn > 1 as subcurvature ones. 

The initial perturbation spectrum produced in inflation can be characterized by 
the “frozen” amplitudes of the metric fluctuations o? on supercurvature scales dur- 
ing the radiation stage (see the following chapter for details). After the perturbation 
enters the curvature scale it evolves in a nontrivial way. Our goal here is to determine 
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the amplitude of the gravitational potential ® and the radiation fluctuations 6, at 
recombination for a given initial o? or, equivalently, to find the transfer functions 
relating the initial spectrum of perturbations with the resulting one at recombi- 
nation. We will see in Chapter 9 that ® and 6, determine the anisotropies in the 
background radiation. 


Long-wavelength perturbations (kn, < 1) We first consider the long-wavelength 
perturbations which are supercurvature modes at recombination. They are described 
by solution (7.71) for adiabatic perturbations in a two-component medium consist- 
ing of cold matter and radiation. Although the dark matter particles are not tightly 
coupled to the radiation, the entropy per cold dark matter particle is nevertheless 
conserved on supercurvature scales because, as can be seen intuitively, there is 
insufficient time to move matter distances greater than the Hubble scale. One can 
formally arrive at this conclusion by noting that, for long-wavelength perturbations, 
the ul ;-terms in (7.107) and (7.110) are negligible and, consequently, the steps which 
lead to the entropy conservation law per baryon (see (7.111)) can be repeated here. 

Knowing the gravitational potential, which is given by (7.71), we can easily find 
6,. Skipping the velocity term in (7.110), which is negligible for the long-wave- 
length perturbations, and integrating, we obtain 


5, —40 =C, (7.118) 


where C is a constant of integration. To determine C we note that, during the radi- 
ation-dominated epoch, the gravitational potential is mainly due to the fluctuations 
in the radiation component and stays constant on supercurvature scales. At these 
early times, 


by X —2®(n K neq) = —2”"; (7.119) 


hence C = —6®°. After equality, when the cold matter overtakes the radiation, the 
gravitational potential ® changes its value by a factor of 9/10 and remains constant 
afterwards, that is, 


®(n > neq) X O°. (7.120) 


Therefore, assuming that cold dark matter dominates at recombination, we obtain 
from (7.118) 


5,(nr) = —6®° + 40(n,) = —$6(n,) = —8(2 ©"). (7.121) 


One arrives at the same conclusion by noting that, for adiabatic perturbations, 
dy = 46/3 and 6g ~ —2@(7,) at recombination. 

Standard inflation predicts adiabatic fluctuations. In principle, one can imag- 
ine alternative possibilities for the initial inhomogeneities, such as entropy 
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perturbations. For example, the dark matter might initially be distributed inho- 
mogeneously on a homogeneous radiation background. It is clear that, at early 
times when radiation dominates, 5, and ® both vanish and the constant of integra- 
tion on the right hand side of (7.118) is equal to zero. After equality, the dark matter 
inhomogeneities induce the gravitational potential. Then, it follows from (7.118) 
that the fluctuations in the radiation component on supercurvature scales are equal 
to 6, = 4, where © is mainly due to the cold dark matter fluctuations (see also 
(7.90)). The differences between this case with entropic perturbations and the adi- 
abatic case give rise to distinctive cosmic microwave background anisotropies. 


Short-wavelength perturbations (kn, > 1) We next consider the subcurvature 
modes which enter the horizon before recombination. These perturbations are es- 
pecially interesting since they are responsible for the acoustic peaks in the cosmic 
microwave background spectrum. 

To simplify the consideration, we neglect the contribution of baryons to the grav- 
itational potential. This approximation is valid in realistic models where the baryons 
constitute only a small fraction of the total matter density. Although we neglect the 
contribution of the baryons to the gravitational potential, they cannot be completely 
ignored since they substantially affect the speed of sound after equality. 

In general, there exist four independent instability modes in the two-component 
medium. The set of equations for the perturbations is rather complicated and they 
cannot be solved analytically without making further approximations. Let us con- 
sider the evolution of perturbations after equality, at 7 > meq. In this case the prob- 
lem is greatly simplified if one notes that at 7 > n-, the gravitational potential 
is mainly due to the perturbations in the cold dark matter and is therefore time- 
independent for both the long-wavelength and the short-wavelength perturbations. 
Thus, at 7 > 1q, the potential ® can be considered as an external source in (7.113) 
and the general solution of this equation can be written as a sum of the general 
solution of the homogeneous equation (with ® = 0) and a particular solution for 
6,. Using the variable x defined by dx = c2dy and taking into account that the time 
derivatives of the potential on the right hand side of (7.113) are zero (® = const), 
(7.113) becomes 


5, 4t, db, 1 4 
= Sey. ee 
dx? 5a dx ct ”  3c4 





AQ, (7.122) 


where the second term is due to the viscosity. If the speed of sound is slowly varying, 
(7.122) has an obvious approximate particular solution: 
4 


by Xz 50. (7.123) 
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To find the general solution of the homogeneous equation (7.122) we employ the 
WKB approximation. Let us consider a plane wave perturbation with wavenumber 
k and introduce the new variable 


200 f 
ye(X) = 5, (k, x) exp zk [tax : (7.124) 


Then, it follows from (7.122) that the variable y, satisfies the equation 


ay, ke Ae kr Ne ce past 
ae 1-3 (=) 7 a) Ut) 





First note that, for a perturbation whose physical wavelength (Ap, ~ a/k) is much 
larger than the mean free path of the photons (~ t,,), the second term in the square 
brackets is negligible. The third term is roughly t,/an ~ t,/t < 1 and can also 
be skipped. With these simplifications, the WKB solution for y is 


d 
ye & Angles cos(t =). (7.126) 
Cs 


where A, is a constant of integration and the indefinite integral implies that the 
argument of the cosine function includes an arbitrary phase. Given the definition 
of y, in (7.124) and combining this solution with (7.123), we obtain the general 
approximate solution of (7.122), valid at 7 > Neg: 


4 
5y(k, n) & — 33 Pe(n > Neq) + Arve; cos(t f extn) ett/koY (7,127) 


Here we have converted back from x to conformal time 7 and introduced the 
dissipation scale corresponding to the comoving wave number: 


—1/2 


2 f 2° 4. 
kp) = 5 ora , (7.128) 
0 


In the limit of constant speed of sound and vanishing viscosity, the solution is 
exact and valid not only for the short-wavelength perturbations but also for the 
perturbations with kn « 1. 


Problem 7.17 Find the corrections to (7.127) and determine when the WKB solu- 
tion is applicable. 


Silk damping From (7.127), it is clear that the viscosity efficiently damps perturba- 
tions on comoving scales 1 < 1/kp. Viscous damping is due to the scattering and 
mixing of the photons. Therefore, the scale where the viscous damping is important 


318 Gravitational instability in General Relativity 


is of order the photon diffusion scale at a given cosmological time t. To estimate 
the diffusion scale, we note that the photons undergo about N ~ t/t, scatterings 
during time t. After every scattering the direction of propagation is completely 
random, so the photon trajectory is similar to that of a “drunken sailor.” Therefore, 
after N steps of length t,, the typical distance travelled (the diffusion scale) is 
about ty /N ~ ,/T,t. Consequently, the ratio of the physical damping scale to the 


horizon scale is 
ea T 
me ~(kpny! ~ [—. (7.129) 


This simple estimate is in agreement with the more rigorous result (7.128). 
Before recombination, the mean free path of the photons is determined by 
Thomson scattering on free electrons: 
1 


] 
OTNe 





t= (7.130) 
where o7 X 6.65 x 10~7° cm? is the Thomson cross-section and n, is the number 
density of free electrons. We are mainly interested in the dissipation scale at re- 
combination time when the universe is dominated by cold dark matter. Taking into 
account that f, o (2,82) 1! a andn, « (Q,h7) ee we infer from (7.129) that 

kn)! ~(ormet,y VY? (Qh?) (Qph2g) 1? 2739/4. (7.131) 


r 


Problem 7.18 Using the exact formula (7.128) with c? = 1/3 and assuming in- 
stantaneous recombination, calculate kp and show that 


(kpnr)} © 0.6(Qmh2)'" (Qph?) 27344. (7.132) 


The dissipation scale can never exceed the curvature scale H~! ~ t, because 
there is insufficient time for radiation to rearrange itself on those scales. This im- 
poses a limit on the range of validity of (7.132), namely,(kpn,) | < 1. If Ty grows 
and begins to exceed the cosmological time ¢, then we have to use the kinetic de- 
scription for photons. Our analysis of viscous damping is also not valid on scales 
smaller than the mean free path of the photons since the hydrodynamical description 
fails in this limit. On scales smaller than the mean free path, another effect, free 
streaming, becomes important. Free streaming refers to the propagation of photons 
without scattering. On scales smaller than the mean free path, photons coming 
from different directions with different temperatures intermingle, smearing spatial 
inhomogeneities in the radiation energy density distribution. However, in contrast 
with viscous damping, free streaming does not remove the angular temperature 
anisotropy of the radiation at a given point (see Problem 9.2). As with viscous 
damping, free streaming has no effect on scales larger than the horizon scale. 
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Transfer functions For long-wavelength perturbations with kn, « 1, the ampli- 
tudes of the metric and radiation fluctuations after equality in terms of ®° are given 
in (7.120) and (7.121). To find the transfer functions for short-wavelength inhomo- 
geneities we have to express O(n > Hea) andA;, entering (7.127), in terms of o). 
This can be done analytically in two limiting cases: for perturbations which enter 
the horizon long enough after and well before equality, that is, for the modes with 
Kteg K Land keg > 1. 

The perturbation with kj-g < 1 enters the horizon when the cold matter already 
dominates and determines the gravitational potential. Therefore the gravitational 
potential does not change and it is given by 


®;(1 > Neq) = e- (7.133) 


The solution (7.127) for 4, is applicable even when the wavelength of the perturba- 
tion exceeds the curvature scale (see Problem 7.17). After equality at 7 >> neq, the 
amplitude of 6, for the supercurvature modes with kn < 1, according to (7.127), 
should be equal to 6, ~ —8,/3 = const. Assuming that the baryons have a neg- 
ligible effect on the speed of sound at this time, so that c? > 1/3, we find that 
A = 4,/37/4. Asa result, after equality but before recombination, we have 





7) 

4 4 /c;s : 2 9 

5,0) =| —33 + Gye 008 | & / cay | eel) (52%) (7.134) 
. 0 


for the modes with n7,! >> k >> nv. 

Now we consider perturbations with keg >> 1, which enter the horizon well be- 
fore equality. At) < meq, radiation dominates over dark matter and baryons. There- 
fore, ® and 6,, are well approximated by (7.61) and (7.62), describing perturbations 
in the radiation-dominated universe. Neglecting the decaying mode on supercurva- 
ture scales and expressing C, in terms of ®?, we find that at Neq >> ko} 


0 
ag 6? cos(kn/V/3) , P(N) = a cos(kn/V/3) , (7.135) 
Ui 





To determine the fluctuations in the cold dark matter component, we integrate 
(7.108) to obtain 


n 7 
saan = 300) + f & f aacean, (7.136) 


This is an exact relation which is always valid for any k. During the radiation-domi- 
nated epoch, the main contribution to the gravitational potential is due to radiation, 
and, therefore, we can treat ® in (7.136) as an external source given by (7.61). 
The two constants of integration can be fixed by substituting (7.61) in (7.136) and 
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noting that, at earlier times when the wavelength of the mode exceeds the curvature 
scale, one should match the result for the long-wavelength perturbations: 


84 X35, /4 ~ —30?/2. (7.137) 


Problem 7.19 Determine the constants of integration in (7.136) and show that after 
the perturbation enters the Hubble scale, but before equality, 


84% -s|¢-5 4. In(kn/V3) fe ok) | 0°, (7.138) 


where C = 0.577... is the Euler constant. 


It is easy to see from (7.116) that, before equality, the contribution of the dark 
matter perturbations to the gravitational potential is suppressed by a factor €4/e, 
compared to the fluctuations in the radiation component. At equality, the dark matter 
contribution begins to dominate and the density perturbation 6, starts to grow « 77, 
as shown in Section 7.3.1. The gravitational potential “freezes” at the value 


4n Gaze 
a 


In(kne 
sa} ~ ocay tnbktea) i) 


Neq k "eq 


®;(1 > Neg) ~ o? (7.139) 


and stays constant until recombination. 
More work is required to obtain the exact coefficients in (7.139). 


Problem 7.20 For short-wavelength perturbations, the time derivatives of the grav- 
itational potential in (7.108) and (7.116) can be neglected compared to the spatial 
derivatives. From these relations, it follows that 


1 
(a5,))' — sna" (ei + sat) = 0. (7.140) 
Cs 


Show that the second term here induces corrections to (7.138) that become signifi- 
cant only near equality. These corrections are mainly due to the ¢gd4 contribution. 
Show that the ¢,6, term remains negligible throughout and, hence, can be omitted 
in (7.140). Then (7.140) coincides with the equation describing the instability in 
the nonrelativistic cold matter component on a homogeneous radiation background 
and its solution is given in (6.72). At x < 1 this solution should coincide with 
(7.138). Considering this limit, show that the integration constants in (6.72) are 





2knx 7 0 0 
C,~-9(1 C-~)®,, G2 9®,, 7.141 
(in( RE ) 7 5) aaa: ae 
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where 7. = Neq /(/2 — 1). Neglecting the decaying mode and using the relation 
between the gravitational potential and 6,, verify that 


In(0.15kn. 
i In(0-15kne2) 50 (7.142) 
(0.27k eq) 
The fluctuations in the radiation component after equality continue to behave as 
sound waves in the external gravitational potential given by (7.142). The integration 


constant A in (7.127) can be fixed by comparing the oscillating part of this solution 
to the result in (7.135) at 7 ~ neq. Then we find that at n > neq, 


©,(n > ea) 


n 
+ 3°/4\/dc, cos kf edn etk/koy (7.143) 
0 


4 In(0.15kneq) 
35 (0.27kNeq)” 





for modes with kneg > I. 

It follows from (7.134) and (7.143) that, for k > n> ' the spectrum of dy at 
recombination is partially modulated by the cosine. This is because all sound waves 
with the same & = |k| enter the horizon and begin to oscillate simultaneously. As 
we will see in the next chapter, this leads to peaks and valleys in the spectrum of 
the temperature fluctuations of background radiation. 

In summary, the results obtained in this section allow us to express the gravi- 
tational potential and the radiation energy density fluctuations at recombination in 
terms of the basic cosmic parameters and the primordial perturbation spectrum. The 
primordial spectrum is described by the gravitational potential ©°, characterizing 
a perturbation with comoving wavenumber k at very early times when its size still 
exceeds the curvature scale. For modes whose wavelength exceeds the curvature 
scale at recombination, the spectrum of ® remains unchanged except that its am- 
plitude drops by a factor of 9/10 after matter—radiation equality, and the amplitude 
of radiation fluctuations is given in (7.121). For perturbations whose wavelength is 
less than the Hubble scale, we have derived asymptotic expressions for the modes 
which enter the curvature scale well before equality (see (7.142), (7.143)), and 
long enough after equality (see (7.133), (7.134)). For these perturbations the initial 
spectrum is substantially changed as a result of evolution. 
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Inflation IH: origin of the primordial inhomogeneities 


One of the central issues of contemporary cosmology is the explanation of the 
origin of primordial inhomogeneities, which serve as the seeds for structure for- 
mation. Before the advent of inflationary cosmology the initial perturbations were 
postulated and their spectrum was designed to fit observational data. In this way 
practically any observation could be “explained”, or more accurately described, 
by arranging the appropriate initial conditions. In contrast, inflationary cosmology 
truly explains the origin of primordial inhomogeneities and predicts their spec- 
trum. Thus it becomes possible to test this theory by comparing its predictions with 
observations. 

According to cosmic inflation, primordial perturbations originated from quantum 
fluctuations. These fluctuations have substantial amplitudes only on scales close to 
the Planckian length, but during the inflationary stage they are stretched to galac- 
tic scales with nearly unchanged amplitudes. Thus, inflation links the large-scale 
structure of the universe to its microphysics. The resulting spectrum of inhomo- 
geneities is not very sensitive to the details of any particular inflationary scenario 
and has nearly universal shape. This leads to concrete predictions for the spectrum 
of cosmic microwave background anisotropies. 

In the previous chapter we studied gravitational instability in a universe filled 
with hydrodynamical matter. To understand the generation of primordial fluctua- 
tions we have to extend our analysis to the case of a scalar field condensate and 
quantize the cosmological perturbations. In this chapter we study the behavior of 
perturbations during an inflationary stage and calculate their resulting spectrum. 
We first consider a simple inflationary model and use the slow-roll approximation 
to solve the perturbation equations. Then the rigorous quantum theory is developed 
and applied to a general inflationary scenario. 
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8.1 Characterizing perturbations 


Ata given moment in time small inhomogeneities can be characterized by the spatial 
distribution of the gravitational potential ® or by the energy density fluctuations 
dé€/€p. It turns out to be convenient to treat them as random fields, for which we 
will use the common notation f(x). Subdividing an infinite universe into a set of 
large spatial regions we can consider a particular configuration f(x) within some 
region as a realization of a random process. This means that the relative number of 
regions where a given configuration f(x) occurs can be described by a probability 
distribution function. Then averaging over the statistical ensemble is equivalent to 
averaging over the volume of the whole infinite universe. 

It is convenient to describe the random process using Fourier methods. The 
Fourier expansion of the function f(x) in a given region of volume V can be 
written as 


1 % 
——— ye 8.1 
f(x) HF d fee (8.1) 


In the case of a dimensionless function f the complex Fourier coefficients, f, = 
ay + iby, have dimension cm*/?. The reality of f requires f_, = fe and hence the 
real and imaginary parts of f, must satisfy the constraints a_, = dx andb_, = —b,. 
The coefficients a, and by, take different values in different spatial regions. Given a 
very large number JN of such regions, the definition of the probability distribution 
function p(ax, b,) tells us that in 


dN = Np(ay, by) daydb_ (8.2) 


of them the value of a, lies between a;, and a, + da, and that of by between b, and 
by, + db,. Inflation predicts only homogeneous and isotropic Gaussian processes, 
for which: 


1 ay be 

D(x, Dk) = — exp(—“5) exp(—25) . (8.3) 
Io O; oy 

where the variance depends only on k = |k|; it is the same for both independent 
variables ax and by, and is equal to a? /2. This variance characterizes the corre- 
sponding Gaussian process entirely and all correlation functions can be expressed 
in terms of ore For example, for the expectation value of the product of Fourier 
coefficients one finds 


(fic fx’) = (aan) + iad) + (aby) — (bbw) = 07 5k,—x', (8.4) 


where we have taken into account that a_, = a, and b_, = —b,. Here 5, 47 = 1 
for k = —k’ and is otherwise equal to zero. 
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In the continuous limit, as V — oo, the sum in (8.1) is replaced by the integral 


d*k 
f(x) = / fxe'™ a ony (8.5) 
and (8.4) becomes 
(fx fv) = ofd(k +k’), (8.6) 


where d(k +k’ ) is the Dirac delta function. Note that the Fourier coefficients in 
(8.5) are related to the Fourier coefficients in (8.1) by a factor of JV and have 
dimension cm?. In contrast to the peter aaa 5x,-k’s the Dirac delta function 
has dimension cm?. The dimension of 7 does not change in the transition to the 
continuous limit and this quantity does not acquire any extra volume factors. 

Alternatively, a Gaussian random field can be characterized by the spatial two- 
point correlation function 


E(k —y) = (f(®) fY)). (8.7) 


This function tells us how large the field fluctuations are on different scales. In 
the homogeneous and isotropic case, the correlation function depends only on the 
distance between the points x and_y, that is, &s = &¢(|x — y|). Substituting (8.5) 
into (8.7) and averaging over the ensemble with the help of (8.6), we find 


ok? sin(kr) dk 
Qn? kr ok’ 
where r = |x — y|. In deriving this relation we have taken into account isotropy 
and performed the integration over angles. The dimensionless variance 





& (Ix — yl) = (8.8) 


Sk) = AS (8.9) 
a4 
is roughly the typical squared amplitude of the fluctuations on scales A ~ 1/k. 


Problem 8.1 Verify that the typical fluctuations of f, averaged over the volume 
V ~ A3, can be estimated as 


2 1/2 


> | tas ~ O(1)5/(k ~ A). (8.10) 
V 


When does this estimate fail? 


Problem 8.2 Show that different variances for a, and by, contradict the assumption 
of homogeneity and that the dependence of oO, on the direction of k is in conflict 
with isotropy. 
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Thus, in the case of the Gaussian random process we need to know only op 
or, equivalently, 5¢(k). For small perturbations Fourier modes evolve indepen- 
dently. Therefore, the spatial distribution of inhomogeneities remains Gaussian 
and only their spectrum changes with time. When the perturbations enter the non- 
linear regime, different Fourier modes start to “interact.” As a result the statistical 
analysis of nonlinear structure becomes very complicated. 

In this chapter we consider only small inhomogeneities and our main task is 
to derive the initial perturbation spectrum generated in inflation. This spectrum 
will be characterized by the variance of the gravitational potential of = |@,|* or, 
equivalently, by the dimensionless variance 


|D,|° ke 


ba(k) = oo 





(8.11) 


In the following we refer to 54,(k) as the power spectrum. Given 54, the corresponding 
power spectrum for the energy density fluctuations can easily be calculated. 


8.2 Perturbations on inflation (slow-roll approximation) 


To aid our intuition we begin with a nonrigorous derivation of the inflationary 
spectrum in a simple model with a scalar field. Let us consider the universe filled 
by ascalar field g with potential V(@), and see how small inhomogeneities 5¢(x, n), 
superimposed on a homogeneous component ¢(7), evolve during the inflationary 
stage. In curved spacetime the scalar field satisfies the Klein—Gordon equation, 


1 0 ara) aV 
Glee a eae 8.12 
a oa) +o (8.12) 
which follows immediately from the action 
S= [Ge eres ~V) J/=ga*e. (8.13) 


A small perturbation 6¢(x, 1) induces scalar metric perturbations and as a result 
the metric takes the form (7.17). Substituting 


~ = go(n) + de(x, n) 


together with (7.17) into (8.12), we find that the Klein—Gordon equation for the 
homogeneous component reduces to 


gp + 2Hg) +a°V, =0 (8.14) 
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(compare with (5.24)). To linear order in metric perturbations and dg it becomes 
59" + 2Hbg' — A(sy — 94(B — E’)) +a’ VggS9 
— 93 + $y + 2a°V od = 0. (8.15) 


This equation is valid in any coordinate system. Using the background equation 
(8.14), we can easily recast it in terms of the gauge-invariant variables ® and W, 
defined in (7.19), and the gauge-invariant scalar field perturbation 


5p = dy — (B- E’). ene) 
The result is 
5p + 2Hdy' — Ad9 +.4°V ggd9 — YB +) +2a°V,6=0. (8.17) 


Problem 8.3 Derive (8.15) and (8.17). (Hint As previously noted, the quickest 
way to derive gauge-invariant equations is to use the longitudinal gauge. After the 
equations are obtained in this gauge, we simply have to replace the perturbation 
variables by the corresponding gauge-invariant quantities: ¢, > ®, yj —> W and 
5¢, — dq. Then using the explicit expressions for &, YW and 5g we can write the 
equations in an arbitrary coordinate system.) 


Equation (8.17) contains three unknown variables, 5Q, ® and W, and should be 
supplemented by the Einstein equations. For these we need the energy-momentum 
tensor for the scalar field, which follows from the action in (8.13) upon variation 
with respect to the metric ggg: 


TS = 8°" Gy 9.8 —(8” 079.5 — V(g)) 55. (8.18) 


Itis convenient to use (7.39). For the energy-momentum tensor (8.18), the perturbed 
gauge-invariant components 577, , defined in (7.35), are 


aay I 1 1 
aT; = =3 9052.1 — 50 (B — E’) ; = (#559) ,- (8.19) 


Equation (7.39) becomes 
W' +H = 4159, (8.20) 


where we have set G = 1. Finally, we note that the nondiagonal spatial components 
of the energy-momentum tensor are equal to zero, that is, 57/ = 0 for i # k, and 
hence VW = ®. 

We will solve (8.17) and (8.20) in two limiting cases: for perturbations with a 
physical wavelength 4, much smaller than the curvature scale H ~! and for long- 
wavelength perturbations with 4, >> H ~! The curvature scale does not change 
very much during inflation. On the other hand the physical scale of a perturbation, 
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A pn ~ a/k, grows. For the modes we will be interested in, the physical wavelength 
starts smaller than the Hubble radius but eventually exceeds it. 

Our strategy will be the following. We start with a short-wavelength perturbation 
and fix its amplitude at the minimal possible level allowed by the uncertainty 
principle (vacuum fluctuations). We then study how the perturbation evolves after 
it crosses the Hubble radius. 

We occasionally adopt the widespread custom of referring to the curvature 
(Hubble) radius as the (event) horizon scale. To avoid confusion the reader should 
be sure to distinguish the curvature radius from the particle horizon scale, which 
grows exponentially during inflation. What is relevant for the dynamics of the pertur- 
bations is the curvature scale, not the particle horizon size which has a kinematical 
origin. 


8.2.1 Inside the Hubble scale 


The gravitational field is not crucial to the evolution of the short-wavelength pertur- 
bations with 4, < H7' or, equivalently, with k >> Ha ~ |n|~!. In fact, for very 
large k |n| the spatial derivative term dominates in (8.17) and its solution behaves as 
exp(+ik7) to leading order. The gravitational field also oscillates, so that ®’ ~ k®, 
and can be estimated from (8.20) as ® ~ k~!gjdg. Using this estimate and taking 
into account that during inflation V.,, < V ~ H?, we find that only the first three 
terms in (8.17) are relevant. Thus, for a plane wave perturbation with comoving 
wavenumber k, this equation reduces to 


5p, + 2HSG, + ky, ~ 0, (8.21) 
and with the substitution 69, = ux/a, it becomes 
” 2 a" 
up +{ ko — > uz = 0. (8.22) 


For k |n| >> 1 the last term in (8.22) can be neglected and the resulting solution for 
dy, is 


as, 46 
50, < — exp(tikn), (8.23) 
a 


where C; is a constant of integration which has to be fixed by the initial conditions. 
The physical ingredient is that the initial scalar field modes arise as vacuum quantum 
fluctuations. 


Quantum fluctuations To make a rough estimate for the typical amplitude of the 
vacuum quantum fluctuations 6g, on physical scales L, we consider a finite volume 
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V ~ L?. Assuming that the field is nearly homogeneous within this volume, we 
can write its action (see (8.13)) in the form 


sx $f +-)ar 


where X = 6y,L*/? and the dot denotes the derivative with respect to the physical 
time ¢. It is clear that X plays the role of the canonical quantization variable and 
the corresponding conjugated momentum is P = X ~ X/L; in the latter estimate 
we have assumed that the mass of the field is negligible and hence the field propa- 
gates with the speed of light. The variables X and P satisfy the uncertainty relation 
AX AP ~ 1 (fh = 1) andit follows that the minimal amplitude of the quantum fluc- 
tuations is X,, ~ /L or 5g, ~ L~'. Thus the amplitude of the minimal fluctuations 
of a massless scalar field is inversely proportional to the physical scale. Taking into 
account that dp, ~ |6g,|k*/?, where k ~ a/L is the comoving wavenumber, we 
obtain 


ko 
\S@x| ~ ae (8.24) 


Comparing this result with (8.23), we infer that |C,| ~ k~'/*. The evolution of the 
mode according to (8.23) preserves the vacuum spectrum. 

The result obtained is not surprising and has a simple physical interpretation. On 
scales smaller than the curvature scale one can always use the local inertial frame 
in which spacetime can be well approximated by the Minkowski metric. Therefore 
the short-wavelength fluctuations “think” they are in Minkowski space and the vac- 
uum is preserved. Above, we have simply described this vacuum in an expanding 
coordinate system, where the perturbations with given comoving wavenumbers are 
continuously being stretched by the expansion. As a result, for a given physical 
scale, they are replaced by perturbations which were initially on sub-Planckian 
scales. This does not mean, however, that for a consistent treatment of quantum 
perturbations we need nonperturbative quantum gravity. Given a physical scale, 
which is larger than the Planckian length, vacuum fluctuations with amplitudes 
given above will always be present, irrespective of whether they are formally de- 
scribed as “being stretched from sub-Planckian scales” in the expanding coordinate 
system or as always existing at the given scale in the nonexpanding local inertial 
frame. 

We noted in Chapter 5 that inflation “washes out” all pre-existing classical in- 
homogeneities by stretching them to very large scales. It is sometimes said that 
inflation removes the “classical hairs.” However, it cannot remove quantum fluctu- 
ations (“quantum hairs’’). In place of the stretched quantum fluctuations, new ones 
“are generated” via the Heisenberg uncertainty relation. For a given comoving 
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wavenumber k, the typical amplitude of fluctuations is of order 
372, & 
Solk) ~ [dpx, k ~ a kw Ha (8.25) 
k 


at the moment of the horizon crossing, Ha, ~ k (or kn, ~ 1). During the inflation- 
ary stage Ha = a increases and a perturbation with a given k eventually leaves the 
horizon. To see whether it will remain large enough after being stretched to galactic 
scales, we have to find out how it will behave on supercurvature scales. 


8.2.2 The spectrum of generated perturbations 


To determine the behavior of long-wavelength perturbations we use the slow-roll 
approximation. In Chapter 5 we saw that for the homogeneous mode this approxi- 
mation means that in the equation 


go + 3HG0o + Vy =0 (8.26) 


we can neglect the second derivative with respect to the physical time ¢ and it 
simplifies to 


3H +V_ = 0. (8.27) 


To take advantage of the slow-roll approximation for the perturbations, we have to 
recast (8.17) and (8.20) in terms of the physical time fr: 


56 + 3Hd9 — Ady + Vopdg — 4G0@ +2V,0 =0, (8.28) 
&+ H® = 4r@édy, (8.29) 


where 5g = dg and we have taken into account that VW = ®. First of all we note 
that the spatial derivative term Adg can be neglected for long-wavelength inhomo- 
geneities. To find the nondecaying slow-roll mode we next omit terms proportional 
to 6@ and ®. (After finding the solution of the simplified equations one can check 
that the omitted terms are actually negligible.) The equations for the perturbations 
become 


3H50 + Vogd9 +2V,R ~ 0, HD ~ 4rPod¢g. (8.30) 
Introducing the new variable 
y = 49/Vy 
and using (8.27), they further simplify to 


3Hy+2@=0, Hd=4nVy. (8.31) 
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Since 3H? ~ 87 V during inflation, we obtain the equation 








d(yV) 
= 0, 8.32 
ai (8.32) 

which is readily integrated to give 
y=A/V, (8.33) 
where A is a constant of integration. The final result for the nondecaying mode is 
V do V t. fVgy 

dQ, = Ap—*, O =40A f= --A,( —*) . 8.34 
Pk ky k= 4a Ag 5 ( V ) (8.34) 


The behavior of d¢;(a) is shown in Figure 8.1. Fora < a, ~ k/H the perturbation 
is still inside the horizon and its amplitude decreases in inverse proportion to the 
scale factor. After horizon crossing, for a > ax, the perturbation amplitude slowly 
increases since V/V grows towards the end of inflation. In particular, for a power- 
law potential, V « g”, we have 5g; « gy !. The integration constant A, in (8.34) 
can be fixed by requiring that 5g has the minimal vacuum amplitude at the moment 
of horizon crossing. Comparing (8.34) to (8.25), we find 


kOe V 
Ax ia (+) ’ 
ak Vio k~Ha 


where the index k ~ Ha means that the corresponding quantity is estimated at the 
moment of horizon crossing. At the end of inflation (t ~t f)s the slow-roll condition 
is violated and V,,/V becomes of order unity. Therefore, it follows from (8.34) 
that at this time the typical amplitude of the metric fluctuations on supercurvature 





























Fig. 8.1. 
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scales is 





V vr 
Sa(k, tp) ~ Ax? ~ ( H— ~ (8.35) 
: V. k~Ha Vo k~Ha 


a 


In particular, for the power-law potential V = Ag"/n we obtain 


n+2 n 
bak. tp)~"?(gpna) © AV? Apne) (8.36) 


where (5.53) has been used in k ~ aH = a; Hy, to express ge. Ha 1 terms of the 
physical wavelength 1.,, ~ a (tr) k—', The spectrum (8.36) is shown in Figure 8.2. 
The effect of H; in the logarithm in (8.36) is not very significant; we make only a 
slight error by taking H; ~ H(tr). 

For a massive scalar field V = m?g*/2 and the amplitude of the metric fluctua- 
tions is 


8o ~ mIn(Apn Hk) « (8.37) 


We will show in the next section that perturbations present at the end of inflation 
survive the subsequent reheating phase practically unchanged. Since galactic scales 
correspond to In(A.,n,Hx) ~ 50 and we require the amplitude of the gravitational 
potential to be O(1) x 10~°, the mass of the scalar field should be about 10~° 
in Planck units or m ~ 10'3 GeV. This determines the energy scale at the end of 
inflation to be ¢ ~ m2 ~ 107!2¢p;. In the absence of a fundamental particle the- 
ory we cannot predict the amplitude of the perturbations; it is a free parameter 
of the theory. However, the shape of the spectrum is predicted: it has logarith- 
mic deviations from a flat spectrum with the amplitude growing slightly towards 
larger scales. We will see that this is a rather generic and robust prediction of 
inflation. 
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Fig. 8.2. 
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Problem 8.4 Show that the scalar field perturbations in the synchronous gauge are 
given by 


se. = Bo — oo [oat (8.38) 


(compare with (7.30)). Substituting (8.34) into (8.38), verify that 6g, = Ci@o, 
where C; is the integration constant in (8.38) corresponding to the purely coordinate 
mode. It is easy to understand why this mode is fictitious by simply considering the 
homogeneous field ¢o(7) and making the coordinate transformation (7.27) which 
preserves the synchronous gauge. As we will see in Problem 8.8, the long-wave- 
length physical perturbations of the scalar fields are suppressed in the synchronous 
gauge by a factor(kn)”. 


Problem 8.5 Itis clear that large-scale metric fluctuations after inflation can depend 
only on the few parameters characterizing them during the inflationary stage. Give 
arguments for why the most natural candidates for these parameters are d¢~, @p and 
H.. Out of these parameters, build the reasonable dimensionless combination which 
could describe metric fluctuations after inflation. Substituting for dg the amplitude 
of the quantum fluctuations, compare the estimate obtained with (8.35). Which 
questions remain open in this dimensional-based approach? 


Problem 8.6 Consider two slow-roll fields g; and @ with potential V(g), g2) = 
V(¢1) + V(@2) and verify that the nondecaying mode of the long-wavelength per- 
turbations is given by 

H 1 Vi{V2 — ViV2 


®=A B F 8.39 
mt H V,+WV ( ) 





The first term on the right hand side here is similar to (8.34) and it can be interpreted 
as the adiabatic mode. The second term describes the entropic contribution which is 
present when we have more than one field. When two or more scalar fields play an 
important role during inflation we can get a variety of different spectra and inflation, 
to a large extent, loses its predictive power. Therefore, we will not consider this case 
any further. (Hint Introduce the new variables y; = d9;/Vj,y and y2 = 5@2/V2,9.) 


8.2.3 Why do we need inflation? 


It is natural to ask whether quantum metric fluctuations can be substantially am- 
plified in an expanding universe without an inflationary stage. Let us explain why 
this is impossible. 

Quantum metric fluctuations are large only near the Planckian scale. For ex- 
ample, in Minkowski space the typical amplitude of vacuum metric fluctuations 
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corresponding to gravitational waves can be estimated on dimensional grounds 
as h ~ Ip;/L, where |p; ~ 10-3 cm. It is incredibly small: on galactic scales 
L ~~ 10” cm, so h ~ 1078. Scalar metric perturbations due to vacuum fluctua- 
tions of the scalar field are even smaller. Thus the only way to get the required 
amplitude @ ~ 10~° on large scales from initial quantum fluctuations is by stretch- 
ing the very short-wavelength fluctuations. During this stretching, the mode must 
not lose its amplitude. Let us consider a scalar field perturbation, which determines 
the metric fluctuations, and find out what generally happens to its amplitude when 
the spatial size of the perturbation is stretched. As we have seen, the amplitude 
decays in inverse proportion to the spatial size until the perturbation starts to “feel” 
the curvature of the universe. This happens when its size begins to exceed the 
curvature scale H~'. Therefore, if during expansion the perturbation size always 
remains smaller than the curvature scale, then its amplitude continuously decreases; 
it “arrives” at large scales with negligible vacuum amplitude. In a decelerating uni- 
verse, the curvature scale H~' = a/a grows faster than the physical wavelength 
of the perturbation (a ph & a) because a is decreasing (see Figure 8.3). Hence, if 
a perturbation is initially inside the horizon, it remains there and decays. Perturba- 
tions on those scales which were initially a little larger than the Hubble radius will 
soon enter the horizon and also decay. Thus, in a decelerating expanding universe, 
quantum fluctuations can never be significantly amplified to become relevant for 
large-scale structure. 





A ~! (inflation) 





ay a 


Fig. 8.3. 
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In a universe which undergoes a stage of accelerated expansion, the Hubble 
scale H~' = a/a grows more slowly than the scale factor a because the rate of the 
expansion a increases. Hence a perturbation which was initially inside the horizon 
soon leaves it (see Figure 8.3) and starts to “feel” the curvature effects which 
preserve its amplitude from decay. The amplitude even grows slightly. We will see 
later that this growth of the amplitude is a rather general property of inflationary 
scenarios and it results in the deviation of the spectrum from a flat one. Thus, the 
initial amplitude of the subcurvature perturbation decays only until the moment 
of horizon crossing. After that it freezes out and the perturbation is stretched to 
galactic scales with nearly unchanged amplitude. Because the curvature scale does 
not change significantly during inflation, the freeze-out amplitude is nearly the 
same for different scales and this leads to a nearly flat spectrum for the produced 
inhomogeneities. 

The initial quantum fluctuations are Gaussian. Subsequent evolution influences 
only their power spectrum and preserves the statistical properties of the fluctuations. 
As a consequence, simple inflation predicts Gaussian adiabatic perturbations. 


8.3 Quantum cosmological perturbations 


In this section we develop a consistent quantum theory of cosmological perturba- 
tions. We consider a flat universe filled by a scalar field condensate described by 
the action 


S= / P(X, 9) /—8d*x, (8.40) 
where 


X= 8 Dad p- (8.41) 


The Lagrangian p(X, ¢) plays the role of pressure. Indeed, by varying action (8.40) 
with respect to the metric, we obtain the energy-momentum tensor in the form of 
an ideal hydrodynamical fluid (see Problem 5.17): 


Ty =(€ + p)u"ug — pdg. (8.42) 
Here u, = ¢,/V2X and the energy density ¢ is given by the expression 
€ =2Xp. x — p, (8.43) 


where p.x = 0p/0X.Thus, ascalar field can be used to describe potential flow of an 
ideal fluid. Conversely, hydrodynamics provides a useful analogy for a scalar field 
with an arbitrary Lagrangian. Action (8.40) is enough to describe all single-field 
inflationary scenarios, including k inflation. If p depends only on X, then e = ¢(X), 
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and in many cases (8.43) can be rearranged to give p = p(e), the equation of state 
for an isentropic fluid. For p « X” we have p = €/(2n — 1), so, for example, the 
Lagrangian p « X* describes an “ultra-relativistic fluid” with equation of state 
p = €/3. In the general case, p = p(X, @), the pressure cannot be expressed only 
in terms of € since X and ¢ are independent. However, even in this case, the 
hydrodynamical analogy is still useful. For a canonical scalar field we have p = 
X — V(@) and, correspondingly, ¢ = X + V. 


8.3.1 Equations 


Here we derive the equations for perturbations and recast them in a simple, conve- 
nient form. The reader interested only in the final result can go directly to (8.56)— 
(8.58). 


Background The state of a flat, homogeneous universe is characterized completely 
by the scale factor a(n) and the homogeneous field g(7), which satisfy the familiar 
equations 


8 
W = a"e, (8.44) 
and 
é = € xXo + € of) = —3H(e+ p), (8.45) 


where Xo = gi (2a’) and we have set G = 1. Substituting ¢ from (8.44) into the 
left hand side of (8.45), we obtain the relation 


H —H? = —4na*(e + p), (8.46) 


which is useful in what follows. 


Perturbations To derive the equations for inhomogeneities we must first express 
the gauge-invariant perturbations of the energy-momentum tensor aT in terms of 
the scalar field and metric perturbations. The calculation can easily be done in the 
longitudinal gauge, where the metric takes the form 


ds? =a*(n) (1 +2) dn? —(1 — 2W) d;¢dx'dx*]. (8.47) 


To linear order in perturbation we have 


1 5g! 
bX = 559 uh + Budo" = 2X0 (-« + ~) (8.48) 
0 
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and the 57S component is 





0 , OP dy 
6T, = b& =€ x6X +€,69 =e x| 5X — Xy— } —3H(e + p) — 
Yo Po 
e+ dp\' 6 3 
z ((*) +n 9) 3c + p) 2. (8.49) 
Cs 0 Po Po 


We have used here the second equality in (8.45) to express €, in terms of € x, € 
and p, and introduced the “‘speed of sound” 


2_ Px ée+p 
rs = 


= = : 8.50 
EX 2X x ( ) 





For a canonical scalar field the “speed of sound” is always equal to the speed of 
light, c; = 1. The components 5T° are readily calculated and the result is 








Q Gi dy 
57? =(e + p)u°du; =(e + pg” =(e+ »( » (8.51) 
Jf 2Xo f/2Xo Po di 


Replacing 5g by 59, defined in (8.16), and substituting (8.49) and (8.51) into (7.38) 
and (7.39), one obtains the for the gauge-invariant variables YW, ® and S59: 


AW —3H(W’+ HO) 


1 65 50 (8.52) 
= 4ra*(e + p) a(( 2) lt nie = °) — ne] : 
cs Lo Qo Po 


(W' +H) = Ana*(e + n(*) : (8.53) 
0 


Since dT; = 0 for i #k, we have VY = ®; the two equations above are sufficient 
to determine the gravitational potential and the perturbation of the scalar field. It is 
useful, however, to recast them in a slightly different, more convenient form. Using 
(8.53) to express ® in terms of W’ and Sg and substituting the result into (8.52), 
we obtain 





A 2 
w= wa pe (nm 


a7 i een v). (8.54) 


2 


where the background equations (8.44) and (8.46) have also been used. Because 
® = W, (8.53) can be rewritten as 


/ 4 
(3) = SGD) Ga a v). (8.55) 





H H? 
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Finally, in terms of the new variables 





w st / 
u= —, v= /exa (50 + “oy ) . (8.56) 
An(e + p)'/ H 


(8.54) and (8.55) become 





v\! u\’ 
ehER (=) 3 Ke o(=) (8.57) 
where 
a(e+ py? 1 {87 1 p\-!/2 
= , = = 1 : 8.58 
: csH se 3 At bs a (8.58) 


8.3.2 Classical solutions 


Substituting v from the second equation in (8.57) into the first gives a closed form, 
second order differential equation for u: 


" 


6 
a” — ec Au= 7 Ta 0. (8.59) 


The variables u and @ coincide (up to irrelevant numerical factors) with the cor- 
responding quantities defined in (7.63) and (7.66) for the hydrodynamical fluid. 
However, now they describe the perturbations in the homogeneous scalar conden- 
sate. 

The solutions of (8.59) were discussed in the previous chapter. Considering a 
short-wavelength plane wave perturbation with a wavenumber k (c?k? >> |a’ ‘1/0 |), 
we obtain in the WKB approximation 


Cc 
<a e0( sik [ etn) ; (8.60) 


where C is aconstant of integration. The long-wavelength solution, valid for Ck? <K 
|0’ ‘1/0 | , is 





u= 


dn 
u=C\O+ cx | aa O((kn)’). (8.61) 
0 


Given u, the gravitational potential can be inferred from the definition in (8.56): 
®= WU =4r(e4+ pp)? u (8.62) 
and a perturbation of the scalar field is calculated using (8.53): 
= (avy  _ , (6+H®) 


=— / =_ . 
a= Ar aXE + p) m 4r(e + p) 





(8.63) 
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Taking into account that 





1 
e+ p =2Xp.x = GH) Px (8.64) 
and substituting (8.60) into (8.62) and (8.63 ), we have 
® ~ AnCh. | P* exp( tit f at). (8.65) 
Cs a 





- 1 k 
io ~C_| (sie.¢ aay ee -) exp (tik [ “ar) - (8.66) 
Cs PX a a 


for a short-wavelength perturbation. 
In the long-wavelength limit the calculation is identical to that done in deriving 
(7.69), and the result is 


d {1 H 
P~ AZ (2 f ar) = a(t _ of cat) ; (8.67) 
dt \a a 
— 1 
6g ~ Ain( 2 [ear) ; (8.68) 


where A is aconstant of integration. (A second constant of integration corresponding 
to the decaying mode can always be shifted to the lower limit of integration.) 

Let us first find out how a perturbation behaves during inflation. It follows from 
(8.65) and (8.66) that in the short-wavelength regime both metric and scalar field 
perturbations oscillate. The amplitude of the metric perturbation is proportional to 
Gp and it grows only slightly towards the end of inflation, while the amplitude of 
scalar field perturbation decays in inverse proportion to the scale factor. After a 
perturbation enters the long-wavelength regime it is described by (8.67) and (8.68). 
These formulae are simplified during slow-roll. Integrating by parts, we obtain the 
following asymptotic expansion: 


~ fade == a ant- 2 f Sary 
a aj H aj H 
_ yO! [ ACH 4 (H-\(H-')) ---| +, (8.69) 


where B is aconstant of integration corresponding to the decaying mode. Neglecting 
this mode we find that to leading order 


is H = Po 
® ~ A(H™') =-Azz, Sp= AT. (8.70) 


It is easy to see that for standard slow-roll inflation these formulae are in agreement 
with (8.34). Result (8.70) is applicable only during inflation. After the slow-roll 
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stage is over we must use (8.67) and (8.68) directly. Inflation is usually followed 
by an oscillatory stage where the scale factor grows as some power of time, a « t?, 
with p depending on the scalar field potential. We have found that for the quadratic 
potential p = 2/3 and for the quartic potential p = 1/2. Neglecting the decaying 
mode we obtain from (8.67) and (8.68) 
@~ oe) bg mas : 
pt+l pt+l 

that is, the amplitude of the gravitational potential freezes out after inflation. 

The scalar field finally converts its energy into ultra-relativistic matter corre- 
sponding to p = 1/2. This influences the perturbations only via the change of the 
effective equation of state and the resulting amplitude is 





(8.71) 


@~ 3A. (8.72) 


Using (8.70), we can express A in terms of 59, @ and H at the moment of sound 
horizon crossing, when c;k ~ Ha. For those perturbations which leave the horizon 
during inflation the final result is 


2(._.s@ 
o~-(H°2 (8.73) 
3 0 csk~ Ha 


Given initial quantum fluctuations, this is consistent with the estimate in ( 8.35) and 
we infer that the amplitude of the perturbation in the radiation-dominated epoch 
differs from its amplitude at the end of inflation only by a numerical factor of order 
unity. Note that (8.73) can also be applied to calculate the perturbations in theories 
with a non-minimal kinetic term. 


Problem 8.7 Using the integral representation of (8.59) (see (7.75)), calculate the 
k*-corrections to the long-wavelength solution (8.61). Verify that the “conserved” 
quantity ¢ « @7(u/6) (see also (7.72)) blows up during an oscillatory stage. Hence, 
contrary to the claims often made in the literature, it cannot be used to trace the 
evolution of perturbation through this stage. 


Problem 8.8 Synchronous coordinate system. (a) Verify that a scalar field pertur- 
bation in the synchronous coordinate system can be expressed through the gravita- 
tional potential as 


2 
ee Tee / Oe eee / neat, (8.74) 
Ha? 


where F, is a constant of integration corresponding to the fictitious mode. The 
relation above is exact. In the long-wavelength limit the physical mode of d¢@, is of 
order k?. Considering a long-wavelength perturbation and using (8.70), show that 
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in an inflationary phase 





ae 4 1 Po kes ; 
dQs = Fspo + safe ( =) . (8.75) 
Skipping the fictitious mode, express A in terms of d¢,, H and @p at the moment 
of the sound horizon crossing. Compare the result obtained with the expression 
previously derived for A in terms of 5g. (Hint To derive the second equality in 
(8.74), use (8.52) to express dy in terms of ® and A®.) 

(b) Substituting (8.67) into (7.29), verify that for a long-wavelength perturbation 


1 eo Ss dt 
Ww, ~ A+ F\A, exaf=(f adi) dr+ Fi f ag or ee (8.76) 
a a 


where F' and F> are constants of integration corresponding to fictitious modes. 
Find the relation between F, and F;, Fy. Write down the metric components dg;x 
in the synchronous coordinate system. 


Starting with quantum fluctuations, the resulting amplitude of perturbations in 
the post-inflationary epoch can be fixed if we know 4¢ at horizon crossing. The 
natural question arises: which 5¢ plays the role of acanonical quantization variable? 
We found in Problem 8.8 that one can get results differing by a numerical factor 
depending on whether we relate the quantum perturbation to 5g, or 5g. This is 
not surprising because at the moment of horizon crossing the metric fluctuations 
may become relevant and the Minkowski space approximation fails. To resolve the 
gauge ambiguity and derive the exact numerical coefficients we need a rigorous 
quantum theory. 


8.3.3 Quantizing perturbations 


Action In order to construct a canonical quantization variable and properly normal- 
ize the amplitude of quantum fluctuations, we need the action for the cosmological 
perturbations. To obtain it one expands the action for the gravitational and scalar 
fields to second order in perturbations. After use of the constraints, the result is 
reduced to an expression containing only the physical degrees of freedom. The 
steps are very cumbersome but fortunately they can be avoided. This is because 
the action for the perturbations can be unambiguously inferred directly from the 
equations of motion (8.57) up to an overall time-independent factor. This factor can 
then be fixed by calculating the action in some simple limiting case. The first order 
action reproducing the equations of motion (8.57) is 


S= / (2) o(5) — 5A) Ou+ e000 dnd>x, (8.77) 
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where O = O(A) is a time-independent operator to be determined. Using the first 
equation in (8.57) to express u in terms of(v/z)’, we obtain 


1 'Ovv\! . 
$= =| (2) —(=) +ev0s] dnd3x. (8.78) 
2 z/ AXZ 
Problem 8.9 Write down the action for a massless scalar field in a flat de Sitter 


universe. Comparing action (8.78) in the limiting case ¢p9/H — 0 to the action for 
a free scalar field in the de Sitter universe, verify that O=A. 


With the result of the above problem, (8.78) becomes 
1 " 

S= [ cana’s a5 [ (x + cvAv+ <7) dnd>x, (8.79) 
z 


after we drop the total derivative terms. Varying this action with respect to v we 
obtain 


" 


v’—Av—-Lv=0. (8.80) 
Zz 


Note that this equation also follows from the second equation in (8.57) after sub- 
stituting u in terms of v. 


Problem 8.10 The long-wavelength solution of (8.80) can be written in a similar 
manner to (8.61): 


v=COz4 oar | 2 + O((kn)’), (8.81) 
No 


where GY and oN are constants of integration. Because u and v satisfy a system 
of two first order differential equations, there are only two independent constants 
of integration. Therefore oy and Co can be expressed in terms of the C; and C2 
in (8.61). Verify that C”” = Cy and CY) = -K?C. 


Quantization The quantization of cosmological perturbations with action (8.79) is 
thus formally equivalent to the quantization of a “free scalar field” v with time- 
dependent “mass” m? = —z"/z in Minkowski space. The time dependence of the 
“mass” is due to the interaction of the perturbations with the homogeneous expand- 
ing background. The energy of the perturbations is not conserved and they can be 
excited by borrowing energy from the Hubble expansion. 

The canonical quantization variable 


v= =xa( 59 + ew) = =xa( so + ty) (8.82) 
H H 


is a gauge-invariant combination of the scalar field and metric perturbations. 
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Problem 8.11 Considering only the physical (nonfictitious) mode of a long- 
wavelength perturbatioun in the synchronous coordinate system, verify that the 
second term in the second equality in (8.82) dominates over the first. 


The first step in quantizing (8.79) is to define the momentum z canonically 
conjugated to v, 


aL 
r= ae vu’. (8.83) 


In quantum theory, the variables v and 2 become operators 0 and 7, which at any 
moment of time 7 satisfy the standard commutation relations: 


[d(7, x), (7, y)] = [7% (9, x), T(y, y)] = 0, 
[0(n, x), #(n, y)] = [b(7, x), O(n, y)] = 15K —y), (8.84) 


where we have set h = 1. The operator } obeys the same equation as the corre- 
sponding classical variable v, 


gl 
i” -—c’Ad-——d=0, (8.85) 
z 
and its general solution can be written as 
eee i [vk(n) ear + vx(n) eat ] seks, (8.86) 
Te ae : Klan)!’ 
where the temporal mode functions v,(7) satisfy 
ug + oz), =0,  @f(n) = c27k? — 2" /z. (8.87) 


We are free to impose the bosonic commutation relations for the creation and 
annihilation operators on the conjugated operator-valued constants of integration 
a, and 4, 





[a4] =[at.at]=0, [a at] =3(k-k’). (8.88) 


Substituting (8.86) into (8.84), we find that they are consistent with commutation 
relations (8.84) only if the mode functions v,x(7) obey the normalization condition 


U_LVE — UKUy = 2i. (8.89) 


The expression on the left hand side is a Wronskian of (8.87) built from two inde- 
pendent solutions vx and vx; therefore it does not depend on time. It follows from 
(8.89) that vu, (7) is a complex solution of the second order differential equation 
(8.87). To specify it fully and thus determine the physical meaning of the opera- 


tors ae we need the initial conditions for vx, and vu, at some initial time 1 = 77. 
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Substituting 
Uk = Ik CXP(i a) 
into (8.89) we infer that the real functions 7, and a, obey the condition 
ra. (8.90) 


Next we note that (8.87) describes a harmonic oscillator with energy 
1 2 
Ex = 5 (|v« + a; lux?) 


ive 1 1 
= 5 ("ik + rpc 2 + awry) = > C+s 5 +o wre (8.91) 
rk 
We want to consider the minimal possible fluctuations allowed by the uncertainty 
relations. The energy is minimized when r,.(n;) = 0 and rx(n;) = ow, °| We thus 
obtain 


ae | 
vans) = —e™™, (i) = i feage’™™, (8.92) 


JK 
Although the phase factors a,,(7;) remain undetermined, they are irrelevant and we 
can set them to zero. Note that the above considerations are valid only if wr > 0, 
that is, for modes with c?k? > (252); 
The next step in quantization is to define the “vacuum” state |0) as the state 
annihilated by operators a, : 


a, \0) = 0. (8.93) 


We further assume that a complete set of independent states in the corresponding 
Hilbert space can be obtained by acting with the products of creation operators on the 
vacuum state |0). If the w; do not depend on time, then the vector |0) corresponds 
to the familiar Minkowski vacuum. Assuming c, changes adiabatically, we find 
that modes with c?k* >> (z’ uf z) remain unexcited and minimal fluctuations are well 
defined. On the other hand, for modes with c?k? < (2"/z), we have oni) < 0, and 
the initial minimal fluctuations on corresponding scales cannot be unambiguously 
determined. These scales exceed the Hubble scale at the beginning of inflation and 
are subsequently stretched to huge unobservable scales; therefore the question of 
initial fluctuations here is fortunately moot. The inhomogeneities responsible for 
the observable structure originate from quantum fluctuations on scales where the 
minimal fluctuations are unambiguously defined. 


Spectrum Our final task is to calculate the correlation function, or equivalently, the 
power spectrum of the gravitational potential. Taking into account (8.56), we have 
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the following expansion for the operator ©: 


: An(e + p)'/? ie ee lies 
O(n, x) = Pf [xeme* a, + uK(7)e Ge (8.94) 


J2 
where the mode functions u,(7) obey (8.59) and are related to the mode functions 
Ux (7) via (8.57). For the initial vacuum state |0) the correlation function at n > 7; 
is 


lamp 


dk 


e (8.95) 


3 is > ,38inkr 

(ODOT PCO sf Ate Pp) [Male 

where r = |x — y|. According to the definition of the power spectrum in (8.8) and 
(8.9), we have 


53(k, n) = 4e + p) luk) Pr. (8.96) 


Given vx(7;) and v,(7;), the initial conditions for ux can be inferred from (8.57). 
Let us consider a short-wavelength perturbation with c2k* >> (2 /z), for which 
w,(ni) & csk. In this case the initial conditions (8.92) can be rewritten in terms of 
Ux as 
~ i / ~ Ves 

Ux(ni) ~ ~ ek? uy(7i) = Ki?’ (8.97) 
where we have neglected higher-order terms, which are suppressed by powers 
of (cskn;)'| < 1. The corresponding short-wavelength WKB solution, valid for 
c2k? > |6”/0|, is 





0 
i . ~ 
Uux(n) & Fe exp ik fevan : (8.98) 


Ni 
During inflation the ratio |6”/6| can be estimated roughly as n~*|H /H7|. Because 
|H/H 71 < 1, (8.98) is still applicable within the short time interval 


1/2 


1 Di 
— >In > _ / A | (8.99) 


Ck 


after the sound horizon crossing. At this time the argument in the exponent is almost 
constant and uy, freezes out. After a perturbation enters the long-wavelength regime 
the time evolution of the gravitational potential is described by (8.67), and hence 


(y) 2 Ak (1 Z i a) (8.100) 
u => = = a ‘ a 
net An(e+ py? An(e + p)!? a 
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We can use (8.69) to simplify this expression during inflation: 








Ax H (e+ p)'? 
a = A, ———.. 8.101 
ux(7) Ane ‘a py? (=) k H2 ( ) 
Taking into account that within the time interval (8.99) the ratio 
(e+ p)'? 
H2 
is almost constant and comparing (8.98) and (8.101), we obtain 
A a. (8.102) 
k= : ; 
ENERO DOP ee, 
Substituting (8.98) into (8.96) gives the scale-independent power spectrum 
4 
82(k, 1) ~ “cep (8.103) 


for short-wavelength perturbations with k > Ha(t) /c,. Using (8.100) with A, as 
given in (8.102), we obtain 


Bye ees tO € me | ) 
65k.) ~ 5 ee a ee _ | adt (8.104) 


for long-wavelength perturbations with Ha(t) /c, > k > Ha;/cs, wherea; = a(t;). 





Problem 8.12 Verify that for a massive scalar field of mass m the spectrum 
3o(Apn, t) as a function of the physical scale 1, ~ a(t) /k is given by 





1, ph < H"', 
ipa a In(ApnH) PEIdO) Ln) pel (8.105) 
vor In(ay/a(t)) )” a ve 


foray > a(t) > a;, where a; anda, are the values of the scale factor at the beginning 
and at the end of inflation respectively. The evolution of the spectrum (8.105) is 
sketched in Figure 8.4. 


It follows from (8.104) that in the post-inflationary, radiation-dominated epoch 
the resulting power spectrum is 


64 
ee ( é ) (8.106) 
81 \cs1 + p/8)) oi~na 


This formula is applicable only on scales corresponding to (cor, a) fo 





k ac. ita) . This range surely encompasses the observable universe. The 


i 


346 Inflation IT: origin of the primordial inhomogeneities 






ee rt) Ap > Az > Ay 


Qo > ay 





m 


V 3x 











ay~ \/GiGF 








ag 143 


a 
H;" ae HG! HN eh 
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supercurvature perturbations are frozen during the radiation-dominated stage and 
they survive unchanged until recombination. Only for those scales which re-enter 
the horizon does the evolution proceed in a nontrivial way. 


Problem 8.13 Verify that for an inflationary model with potential V =(A/n) 9", 


9 128 ni? 


°~ 37 Gay [In(Apn/y)]? (8.107) 


where A,, is the typical wavelength of the background radiation. 


Spectral tilt It follows from (8.106) that the amplitude of the metric perturbation 
on a given comoving scale is determined by the energy density and by deviation 
of the equation of state from the vacuum equation of state at the time of horizon 
crossing. On galactic scales, 53 is of order 10~!° and(1 + p/e) can be estimated 
as ~ 1077; therefore we conclude that ¢ ~ 10~!* of the Planckian density at this 
time. This is a rather robust and generic estimate for inflation during the last 70 
e-folds. Only if c; « 1, for instance in k inflation, can we avoid this conclusion. 
Since inflation must have a graceful exit, the energy density and the equation 
of state slowly change during inflation. As a consequence the amplitude of the 
perturbations generated depends slightly on the lengthscale. The energy density 
always decreases and it is natural to expect that the deviation of the equation of 
state from that for the vacuum should increase towards the end of inflation. It 
follows then from (8.106) that the amplitude of those perturbations which crossed 
the horizon earlier must be larger than the amplitude of perturbations which crossed 
later. Within a narrow range of scales, one can always approximate the spectrum 
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by the power law, 63,(k) « k"s~!, and thus characterize it by the spectral index ns. 
A flat spectrum corresponds tons = 1. 
The expression for the spectral index follows from (8.106): 


eer ee “ ~ -3(1 4: ) = J (in(1 4 Py) J2e (8.108) 


where the quantities on the right hand side must be calculated at the time of horizon 
crossing. In deriving this formula we have taken into account that d Ink ~ d Inag. 
This relation follows from the condition determining horizon crossing, c;k ~ Hag, 
if we neglect the change inc, and H. All terms on the right hand side of (8.108) are 
negative for a generic inflationary scenario. Therefore, inflation does not predict 





a flat spectrum, as is quite often mistakenly stated. Instead, it predicts a red-tilted 
spectrum: gs < 1 so that the amplitude grows slightly towards the larger scales. The 
physical reason for this tilt is the necessity for a smooth graceful exit. To obtain 
an estimate for the tilt we note that the galactic scales cross the horizon around 
50-60 e-folds before the end of inflation. At this time(1 + p/e) is larger than 1077. 
The second term in (8.108) is about the same order of magnitude and the spectral 
index can thus be estimated as ns ~ 0.96. The concrete value of ns depends on a 
particular inflationary scenario. Even without knowing this scenario, however, one 
could expect that ns < 0.97. By inspection of the variety of scenarios, one infers 
that it is rather difficult to get a very large deviation from the flat spectrum and that 
itis likely ns > 0.92. 


Problem 8.14 Consider inflation in a model with potential V and verify that 
SY Ve 1¥ 
-1le- mM ne 8.109 
iS =( V ) 5 4n V ( ) 
Check that for the power-law potential, V « @”, 


n(n + 2) Jeg le 2 
80 e247, 2N 








ng -12- ; (8.110) 
where WN is the number of e-folds before the end of inflation when the corresponding 
perturbation crosses the horizon. In the case of a massive scalar field, n = 2, and 
ng = 0.96 on galactic scales for which N ~ 50. For the quartic potential n = 4 and 
ng = 0.94. How much does the spectral index “run” when the scale changes by one 
decade? 


How do quantum fluctuations become classical? When we look at the sky we see 
the galaxies in certain positions. If these galaxies originated from initial quantum 
fluctuations, a natural question arises: how does a galaxy, e.g. Andromeda, find it- 
self at a particular place if the initial vacuum state was translational-invariant with 
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no preferred position in space? Quantum mechanical unitary evolution does not de- 
stroy translational invariance and hence the answer to this question must lie in the 
transition from quantum fluctuations to classical inhomogeneities. Decoherence is 
a necessary condition for the emergence of classical inhomogeneities and can easily 
be justified for amplified cosmological perturbations. However, decoherence is not 
sufficient to explain the breaking of translational invariance. It can be shown that 
as a result of unitary evolution we obtain a state which is a superposition of many 
macroscopically different states, each corresponding to a particular realization of 
galaxy distribution. Most of these realizations have the same statistical properties. 
Such a state is a close cosmic analog of the “Schrédinger cat.” Therefore, to pick 
an observed macroscopic state from the superposition we have to appeal either to 
Bohr’s reduction postulate or to Everett’s many-worlds interpretation of quantum 
mechanics. The first possibility does not look convincing in the cosmological con- 
text. The reader who would like to pursue this issue can consult the corresponding 
references in “Bibliography” (Everett, 1957; De Witt and Graham, 1973). 


8.4 Gravitational waves from inflation 


Quantizing gravitational waves In a similar manner to scalar perturbations, long- 
wavelength gravitational waves are also generated in inflation. The calculations are 
not very different from those presented in the previous section. First of all we need 
the action for the gravitational waves. This action can be derived by expanding the 
Einstein action up to the second order in transverse, traceless metric perturbations 
h;,. The result is 


s a (nih! — ni hf) dnddx, (8.111) 


~ 64m 


where the spatial indices are raised and lowered with the help of the unit tensor 4;,. 


Problem 8.15 Derive (8.111). (Hint Calculate the curvature tensor R considering 
small perturbations around Minkowski space and then use (5.111) to make the 
appropriate conformal transformation to an expanding universe.) 


Substituting the expansion 





hi =|fh i(k) ei as 8.112 
(% M= | he(nej(kye Ony?’ (8.112) 

where e'(k) is the polarization tensor, into (8.111), we obtain 
S= a ae] (yh! — k°-hyh_x) dnd*k. (8.113) 
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Rewritten in terms of the new variable 





UK = 1/ ——ahx, (8.114) 


the action becomes 


1 ql! 
S=5 [ (oe — (i = ~) nv) dnd*k. (8.115) 
a 


It describes a real scalar field in terms of its Fourier components. The resulting 
equations of motion are 


uf + w(n)vK =0,  wp(n) =k? — a" Ja. (8.116) 


There is no need to repeat the quantization procedure for this case. Taking into 
account (8.114) and (8.112), we immediately find the correlation function 





8 sinkr dk 
(OLnLOn.x HJ y)10) = sf IPRA 8.117 
a kr ok 
where v, is the solution of (8.116) with initial conditions 
1 
uK(7i) = Ug(ni) = i/o. (8.118) 


af Wk ; 
These initial conditions make sense only if a, > 0, that is, for gravitational waves 
with k? > (a” /a),,: The power spectrum, characterizing the strength of a gravita- 
tional wave with comoving wavenumber k, is correspondingly 


8 |u_ |? k? 
52(k, n) = oe (8.119) 


Inflation In contrast to scalar perturbations, the deviation of the equation of state 
from the vacuum equation of state is not so crucial to the evolution of gravitational 
waves. Therefore, we first consider a pure de Sitter universe where a = —(H,n)~ : 
In this case (8.116) simplifies to 


” 2 2 
UK +(k — =) UK = 0 (8.120) 
and has the exact solution 
1 : ; 
UK(7) = fl {C [kn cos(kn) — sin(kn)] + C2 [kn sin(kn) + cos(kn)]}. (8.121) 


Let us consider gravitational waves with k |n;| >> 1 for which w, ~ k. Taking 
into account the initial conditions in (8.118), we can determine the constants of 
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Fig. 8.5. 
integration C; and C) and the solution becomes 
1 i 
UK()) = = (1 + x) exp(ik(n — n;)). (8.122) 
KU) Vk kn PUK) — 1 
Substituting this into (8.119), we obtain 
8H? 8H? ba 
62 = —A[14(kny’] = —4]14(—) |, 8.123 
i — [1 +n)" eal a (8.123) 


where k,;, = k/a is the physical wavelength. This formula is applicable only 
for Kpn >> Ha(n/n;i). The amplitude 6; as a function of the physical wavelength 
ph ™ Koh is sketched in Figure 8.5. Long-wavelength gravitational waves with 
AK (ni [> don > Hy! have a flat spectrum with amplitude proportional to H,. 

The above consideration refers to a pure de Sitter universe where H, is exactly 
constant. In realistic inflationary models the Hubble constant slowly changes with 
time. Recalling that the nondecaying mode of a gravitational wave is frozen on 
supercurvature scales (see Section 7.3.2), we obtain 

8Hew nq 64 


Yaw = —&~Ha- 8.124 
h a 3 Ek~H ( ) 


The tensor spectral index is then equal to 


dn; 3(1+4) 8.125 
dink €) k~Ha’ 2) 





nT = 
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and hence the spectrum of the gravitational waves is also slightly tilted to the 
red. (Note that the tensor and scalar spectral indices are defined differently — see 
(8.108).) The ratio of tensor to scalar power spectrum amplitudes on supercurvature 
scales during the post-inflationary, radiation-dominated epoch is 


2 

Sh w 97 [e(1+4)] (8.126) 
5% €/ Ik~Ha 

For a canonical scalar field(c, = 1), this ratio is between 0.2 and 0.3. However, in 
k inflation, where c, « 1, it can be strongly suppressed. Thus, at least in principle, 
k inflation is phenomenologically distinguishable from inflation based on a scalar 
field potential. 


Post-inflationary epoch We found in Section 7.3.2 that the amplitude of a grav- 
itational wave stays constant on supercurvature scales irrespective of changes in 
the equation of state. When the gravitational wave re-enters the horizon, however, 
its amplitude begins to decay in inverse proportion to the scale factor. Hence the 
spectrum remains unchanged only on large scales and is altered within the Hubble 
horizon. Neglecting the dark energy component, we can express the Hubble con- 
stant at earlier times in terms of its present value Ho as 


(ao/ay?/? ’ Z< Zeqs 


oe 8.127 
Za, (aufays 2 > Zens Se 


H (a) Se Hh 
where Zeq is the redshift at matter—radiation equality. For a gravitational wave with 
a comoving wavenumber k, the value of the scale factor at horizon crossing, ax, 
is determined from the condition k ~ H(a,) ax. After that the amplitude decreases 
by a factor a, /ag and we obtain the following spectrum at the present time: 


Zeq! (Apa), App < Ho 'zeq"”s 
bn ~ Hy § (Apn Ho)” H;! S Rey > Hy a, (8.128) 
iB Nin Sigs 


where Hj, is the value of the Hubble constant during inflation and ,; ~ ao/k 
is the physical wavelength. This spectrum is sketched in Figure 8.6. On scales of 
several light years the typical amplitude of the primordial gravitational waves can 
be estimated as roughly 107!” for a realistic model of inflation. This amplitude 
drops linearly towards smaller scales and so the prospects of direct detection of the 
primordial gravitational background are not very promising. However, as we shall 
see in the next chapter, these gravitational waves influence the CMB temperature 
fluctuations and therefore may be detectable indirectly. 
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8.5 Self-reproduction of the universe 


The amplitude of scalar perturbations takes its maximum value on scales corre- 
sponding to k ~ Hja;, that is, those which left the horizon at the very beginning of 
inflation. For a massive scalar field at the end of inflation, this maximal amplitude 
can be estimated from (8.37) and is equal to 


5p ~ mIn(az/ai) ~ mg. (8.129) 


If the initial value g; is larger than m—'/?. then inhomogeneities on scales App ~ 


Hy a /a; become very large (5 > 1) before inflation ends. Therefore, for large 
initial values of the scalar field, the initial homogeneity is completely spoiled by 
amplified quantum fluctuations on scales exceeding A pj, (ty) ~ H-'exp (m- a. For 
realistic values of m these scales are enormous. For example, if m ~ 10~°, they 
are larger than H~! exp(10°) and exceed the observable scales, ~ H~! exp(70), 
by many orders of magnitude. On scales smaller than H~! exp (m-') the universe 
remains quasi-homogeneous. Thus, if inflation begins at m~! > g; > m7'/?, then, 
on one hand, it produces a very homogeneous, isotropic piece of space which is 
large enough to encompass the observed universe while, on the other hand, quantum 
fluctuations induce a large inhomogeneity on scales much larger than the observable 
scale. 

Futhermore, if g; > m~*’~ in one causally connected region, then inflation never 
ends but continues eternally somewhere in space. To see why this happens let us 
consider a causal domain of size H~'. In a typical Hubble time, Aty ~ H~, 


1/2 
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the size of this domain grows to H~! exp(H At) ~ eH! and hence gives rise to 
e? ~ 20 new domains of size H—'. Now consider the averaged value of the scalar 
field in each of these new domains. During a Hubble time the classical scalar field 
decreases by an amount 


V 
Aga ~——2Aty ~ =o. 8.130 
Pel aH tH 7) ( ) 


Simultaneously, quantum fluctuations stretched from sub-Hubble scales begin to 
contribute to the mean value of the scalar field in each domain of size H~'. Quantum 
fluctuations with wavelength of order H~' and amplitude Ag, ~ H ~ mg are 
superimposed on the classical field; in half of the regions they decrease the value 
of the scalar field still further (Figure 8.7), while in the other half they increase the 
field value. The overall change of ¢ in these latter domains is about 


AGior = Aga + Avy ~ —97 | +mg. (8.131) 


It is clear that if g > m~'/?, the field grows and inflation always produces regions 


where the scalar field exceeds its “initial” value. In Figure 8.8 we sketch the typical 
trajectories describing the evolution of the scalar field within a typical Hubble 
domain. For g < m~'/*, quantum fluctuations only slightly disturb the classical 
slow-roll trajectory, while for g >> m~'/* they dominate and induce a “random 
walk.” Because each domain of size H~' in turn produces other domains at an 
exponential rate, the physical volume of space where the scalar field is larger than its 
initial value grows exponentially. Thus, inflation continues forever and the universe 
is said to be “self-reproducing.” In those regions where the field drops below the 


354 Inflation IT: origin of the primordial inhomogeneities 





Fig. 8.8. 


self-reproduction scale, Qe) ~ m~'/2, quantum fluctuations are no longer relevant. 
The size of these regions grows exponentially and eventually they produce very 
large homogeneous domains where the universe is hot. 


Problem 8.16 Determine the self-reproduction scale for the power-law potential 
V =(A/n) 9". 


To conclude, we have found that inflation generically leads to a self-reproducing 
universe and induces complicated global structure on very large scales. This global 
structure may become relevant for an observer but only in many many billions of 
years. However, for a better understanding of the initial condition problem, the self- 
reproduction regime is very important. There is no complete and reliable description 
of the self-reproducing universe at present and much more work has to be done to 
clarify the question of the global structure of the universe. 


8.6 Inflation as a theory with predictive power 


Assuming a stage of cosmic acceleration — inflation — we are able to make ro- 
bust predictions even in the absence of the actual inflationary scenario. The most 
important among them are: 


(i) the flatness of the universe; 
(ii) Gaussian scalar metric perturbations with a slightly red-tilted spectrum; 
(iii) long-wavelength gravitational waves. 
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The condition of flatness is not as “natural” as it might appear at first glance. We 
recall that (29 = 1 was strongly disfavored by observations not so long ago. If gravity 
were always an attractive force, it is absolutely unclear why the current value of Qo 
could not be, for instance, 0.01 or 0.2. Only inflation gives a natural justification for 
Qo = 1. The deuterium abundance clearly indicates that baryons cannot contribute 
more than a small percentage of the critical energy density. Therefore, inflation 
also predicts the existence of a dark component. It can be dark matter, dark energy 
or a combination of the two. In the absence of the actual inflationary scenario, 
we cannot make any prediction about the composition of the dark component. In 
spite of the tremendous progress made, we are still far from understanding the true 
nature of dark matter and dark energy. The current data on CMB fluctuations favor 
the critical density and, combined with the results from high-redshift supernovae, 
make it almost impossible to doubt the existence of dark matter and dark energy. 

The predicted spectrum for the scalar perturbations is also in good agreement 
with the current data. However, the accuracy of the observations is not yet sufficient 
to determine a small spectral tilt. The deviation of the spectrum from flat is an 
inevitable consequence of simple inflation and therefore it is extremely important 
to detect it. The amplitude of the power spectrum is a free parameter of the theory. 

The production of a significant number of long-wavelength gravitational waves 
is another generic prediction of a broad class of simple inflationary scenarios. While 
their detection would strongly support inflation, the absence of gravitational waves 
would not allow us to exclude simple inflation since their production can be avoided 
in k inflation. 

Since we do not know which concrete scenario was realized in nature, the ques- 
tion of the robustness of the predictions of inflation is of particular importance. 
Simple inflation does not leave much room for ambiguities. However, it is clear 
that by introducing extra parameters and by fine-tuning, one can alter the robust pre- 
dictions of the simple inflationary models. For example, by designing specifically 
fine-tuned potentials one can avoid the flatness constraint. Similarly, by involving 
many scalar fields, or by studying models with several different stages of inflation, 
one can obtain practically any spectrum of cosmological perturbations and induce 
nongaussianity. In our point of view, an increase of complexity of the models si- 
multaneously increases the “price-to-performance” ratio; the theory gradually loses 
its predictive power and becomes less attractive. Only observations confirming the 
robust predictions of inflation can completely assure us that we are on the right 
track in understanding our universe. 


9 


Cosmic microwave background anisotropies 


After recombination, the primordial radiation freely streams through the universe 
without any further scattering. An observer today detects the photons that last 
interacted with matter at redshift z ~ 1000, far beyond the stars and galaxies. The 
pattern of the angular temperature fluctuations gives us a direct snapshot of the 
distribution of radiation and energy at the moment of recombination, which is 
representative of what the universe looked like when it was a thousand times smaller 
and a hundred thousand times younger than today. 

The first striking feature is that the variations in intensity across the sky are tiny, 
less than 0.01% on average. We can conclude from this that the universe was ex- 
tremely homogeneous at that time, in contrast to the lumpy, highly inhomogeneous 
distribution of matter seen today. The second striking feature is that the average 
amplitude of the inhomogeneities is just what is required in a universe composed 
of cold dark matter and ordinary matter to explain the formation of galaxies and 
large-scale structure. Moreover, the temperature autocorrelation function indicates 
that the inhomogeneities have statistical properties in perfect accordance with what 
is predicted by inflationary models of the universe. 

In a map showing the microwave background temperature across the sky, the 
features subtending a given angle are associated with physics on a spatial scale 
that can be computed from the angle and the angular diameter distance to the last 
scattering surface. The latter depends on the cosmological model. The angular scale 
6 ~ 1° corresponds to the Hubble radius at recombination, which is the dividing 
line between the large-scale inhomogeneities that have not changed much since 
inflation and the small-scale perturbations that have entered the horizon before re- 
combination and been substantially modified by gravitational instability. Hence, 
observations of temperature fluctuations on large angular scales give us direct in- 
formation about the primordial spectrum of perturbations, and observations of the 
small-scale fluctuations enable us to determine the values of the cosmological 
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parameters that control the change of perturbation amplitudes after they enter the 
Hubble scale. 

The purpose of this chapter is to derive the spectrum of microwave background 
fluctuations, assuming a nearly scale-invariant spectrum of primordial inhomo- 
geneities, as occurs in inflationary models. Today, sophisticated computer programs 
are used to obtain numerically precise predictions. Here, though, our purpose is to 
understand from first principles the physics behind the characteristic features of the 
spectrum and to determine how they depend on fundamental parameters. We are 
willing to sacrifice a little accuracy to obtain a solid, analytic insight. 

We first use an approximation of instantaneous recombination in which the 
radiation behaves as a perfect fluid before recombination and as an ensemble of 
the free photons immediately afterwards. This approximation is very good for 
large angular fluctuations arising from inhomogeneities on scales larger than the 
Hubble radius at recombination. In fact, recombination is a more gradual process 
that extends over a finite range of redshifts and this substantially influences the 
temperature fluctuations on small angular scales. The computations are then more 
complicated, but the problem is still treatable analytically. 

Throughout this chapter we consider a spatially flat universe predicted by in- 
flation and favored by the current observations. The modifications of the most 
important features of the CMB spectrum induced by the spatial curvature are rather 
obvious and will be briefly discussed. 


9.1 Basics 


Before recombination, radiation is strongly coupled to ordinary matter and it is 
well approximated as a perfect fluid. When sufficient neutral hydrogen has been 
formed, the photons cease interacting with the matter and, therefore, they must be 
described by a kinetic equation. 


Phase volume and Liouville’s theorem The state of a single photon with a given 
polarization at (conformal) time 7 can be completely characterized by its position 
in the space x‘(n) and its 3-momentum p;(n), where i = 1, 2, 3 is the spatial index. 
Since the 4-momentum p, satisfies the equation g° py pg = 0, the “energy” po can 
be expressed in terms of the metric ggg and pj. 

The one-particle phase volume element is a product of the differentials of spatial 
coordinates and covariant components of the momentum: 


d°xd? p = dx'dx*dx3dp\dprdp3. (9.1) 


358 Cosmic microwave background anisotropies 


It is invariant under general coordinate transformations. To prove this, let us go to 
another coordinate system 


7= a(n, x'), x= Sia). 


The phase volume d?Xd*p calculated at the hypersurface 7 = const in this new 
coordinate system is related to (9.1) by 





id p = Jd>xd’ p, (9.2) 
where 
A Care sale are ,? , ip 
= ( st Pi, p2 Ps) (9.3) 
O(x!, x7, x°, Di, Pr, P3) 
is the Jacobian of the transformation 
xi > = H(i), pi > Bi =(8x"/0%'). Pa (9.4) 


Note that the new coordinates <' should be considered here as functions of the old 
coordinates x/ and the new time #. Since(9X/dp) = 0, we have 


ax! ax/ j 
[= oa ace be va Se = det(5;) =r (9.5) 


ox/ 
and therefore the phase volume is invariant. 








7j=const 


Problem 9.1 Verify that dx!dx*dx3dp'dp*dp? is not invariant under coordinate 
transformations. 


Liouville’s theorem, which can easily be proved in flat spacetime, states that 
the phase volume of the Hamiltonian system is invariant under canonical trans- 
formations, or in other words, it is conserved along the trajectory of the particle. 
Considering an infinitesimal volume element, one can always go to a local iner- 
tial coordinate system (the Einstein elevator) in the vicinity of any point along the 
particle trajectory where the same theorem will obviously be valid. Yet, since the 
phase volume (9.1) is independent of the particular coordinate system, it must also 
be conserved as we move along a trajectory in curved spacetime. Hence, Liouville’s 
theorem must continue to hold in General Relativity. 


The Boltzmann equation Let us consider an ensemble of noninteracting identical 
particles. If dN is the number of particles per volume element d?xd* p, then the 
distribution function f, characterizing the number density in one-particle phase 
space, is defined by 


dN = f(x', pj, t)d°xd* p. (9.6) 
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Since phase volume is invariant under coordinate transformations, f is a spacetime 
scalar. In the absence of particle interactions (scatterings), the particle number 
within the conserved phase volume does not change. As a result, the distribution 
function obeys the collisionless Boltzmann equation 


Df(x'(m), pi(m).n) _ af dx' af | dpi af _ 
Dn ~ an dn odxi dn dp; — 





0, (9.7) 
where dx! /dn and dp; /dn are the derivatives calculated along the geodesics. 


Temperature and its transformation properties Let us consider a nearly homoge- 
neous isotropic universe filled by slightly perturbed thermal radiation. The fre- 
quency (energy) of the photon measured by an observer is equal to the time com- 
ponent of the photon’s 4-momentum in the comoving local inertial frame of the 
observer. Therefore in an arbitrary coordinate system where an observer has 4- 
velocity u® and the photon 4-momentum is p,, this frequency can be expressed as 
@ = Pau. If the radiation coming to an observer from different directions 


Pi 
<p? 


‘— 





has the Planckian spectra, then the distribution function is 


= (@ 2 
f= HG) ~ exp(w/T®, 1) — 1 Co) 





The effective temperature T(x“, /') depends not only on the direction /' but also 
on the observer’s location x‘ and on the moment of time 7. The factor of 2 in the 
numerator accounts for the two possible polarizations of the photons. 

In a nearly isotropic universe, this temperature can be written as 


T (x%, l') = To(n) + 6T(x%, 1’), (9.9) 


where 5T < To. To understand how the fluctuations 5T depend on the coordinate 
system, let us consider two observers O and O, who are at rest with respect to two 
different frames related by the coordinate trasformation x* = x® + &*. In the rest 
frame of each observer, the zeroth component of 4-velocity can be expressed through 
the metric by using the relation Sapurul = 200 (u°)? = |. Thus we conclude that 
the frequency of the same photon differs as measured by different observers. They 
are equal to 


@ = Pat" = po//800 and & = po/V Zoo 


respectively, where the photon’s momentum and metric components in different 
frames are related by coordinate transformation laws. Using these laws together 
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with the relation py p* = 0 we get 


= of ae oe. ) (9.10) 





an 


where we have kept only the first order terms in €* and in the metric perturbations 
around the homogeneous isotropic universe. Since the distribution function (9.8) is 
a scalar, 


o/T(x") = @/TR), 
and hence the temperature fluctuations measured by two observers are related as 


L 
SF = 57 — Tie + Ty et, (9.11) 
an 
We can see from this expression that the monopole (/'-independent) and dipole 
(proportional to /‘) components of the temperature fluctuations depend on the par- 
ticular coordinate system in which an observer is at rest. If we can only observe the 
radiation from one vantage point, then the monopole term can always be removed 
by redefinition of the background temperature. The dipole component depends on 
the motion of the observer with respect to the “preferred frame” determined by 
the background radiation. For these reasons, neither the monopole nor the dipole 
components are very informative about the initial fluctuations. We have to look to 
the quadrupole and higher-order multipoles instead, which do not depend on the 
motion of the particular observer and coordinate system we use to calculate them. 


9.2 Sachs—Wolfe effect 


In this section we will solve the Boltzmann equation for freely propagating radiation 
ina flat universe using the conformal Newtonian coordinate system where the metric 
takes the form 


ds? =a? (1 +20)dn* —(1 — 2) d¢dx'dx"}. (9.12) 


Here ® < | is the gravitational potential of the scalar metric perturbations. We 
discount gravitational waves for the moment and consider them later. 


Geodesics The geodesic equations describing the propagation of the radiation in 
an arbitrary curved spacetime can be rewritten as (see Problem 2.13) 


dx® APa _ 1 08y5 ie 


Qa 


Ge Ge ae ; 








(9.13) 
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where A is an affine parameter along the geodesic. Since the photons have zero 
mass, the first integral of these equations is p“ py = 0. Using this relation, we can 
express the time component of the photon 4-momentum in terms of the spatial 
components. Up to first order in the metric perturbations, one obtains 


1 1/2 Pp 
p= (pi) = 3 Po=(l+2)p. (9.14) 


Then, from the first equation in (9.13) we can see that 


1 
i al +2®) p; 
a 





Gxe -- p ; 
where l' = — p;/p. Expressing p° and p’ in terms of p; and substituting the metric 


(9.12) into the second equation (9.13), we obtain 


dpa _ 198s p’P” _, 9® 
dn 2 dx® pX — Paya’ 





(9.16) 


Equation for the temperature fluctuations Using the geodesic equations (9.15) and 
(9.16), the Boltzmann equation (9.7) takes the form 


af a® af _ 
an dx/ Op 





. P 
Pili 2b) <r +2p (9.17) 
x! 


Since f is the function of the single variable 


oT 
T T./g00 Toa Ty 


the Boltzmann equation to zeroth order in the perturbations reduces to 





(Toa) = 0, (9.19) 


and to linear order becomes 


see | Ee pe a (9.20) 
an ax! T On’ : 


Solutions The zeroth order equation (9.19) informs us that the temperature of the 
background radiation in a homogeneous universe is inversely proportional to the 
scale factor, while (9.20) determines the temperature fluctuations of the microwave 
background. In the case of practical interest, the universe is matter-dominated after 
recombination and the main mode in © is constant. Therefore, the right hand side 
of (9.20) vanishes. The operator on the left hand side is just a total time derivative 
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and we obtain the result that 
éT 
(+ + °) = const, (9.21) 


along null geodesics. The influence of the gravitational potential on the microwave 
background fluctuations is known as the Sachs—Wolfe effect. 

In reality, radiation is a small but not completely negligible fraction of the en- 
ergy density immediately after recombination, and so ©® is actually slowly time- 
varying. Consequently, the combination (6T/T + ®), according to (9.20), varies 
by an amount proportional to the integral of 0 ®/dn along the geodesic of the pho- 
ton. The contribution to the temperature fluctuations induced by the change of the 
gravitational potential due to the residual radiation after equality is called the early 
integrated Sachs—Wolfe effect. When dark energy, either quintessence or vacuum 
density, overtakes the matter density in more recent epochs, the gravitational po- 
tential begins to vary once again, causing a further contribution to the temperature 
fluctuations. This phenomenon is referred to as the late integrated Sachs—Wolfe ef- 
fect. To simplify the final formulae, we neglect both integrated Sachs—Wolfe effects 
which in any case never contribute more than 10—20% to the resulting amplitudes 
of the fluctuations. The reader can easily generalize the formulae derived below to 
include these effects. 


Free-streaming Let us consider an initial distribution of free photons with temper- 
ature T + 6T where 


(n x, I) = Ax sin(kx) g(). (9.22) 
In this case, the radiation energy density is inhomogeneous and its spatial variation 
is proportional to 
(87 /T) = Ag sin(kx) (g),, 
where (), denotes the average over directions I. 


Problem 9.2 Neglecting the gravitational potential ® in (9.20), show that at later 


times 
6T sin(k(n — n;)) [dT 
( T om) k(n — m) ( T (mn) rae 
Thus, at 7 >> 7;, the initial spatial inhomogeneities of the energy density of free 
photons will be suppressed by the ratio of the inhomogeneity scale to the Hubble 
radius. This damping effect is known as free-streaming. The suppression occurs 
as a result of the mixing of photons arriving at a given point from regions with 
different temperatures. 
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Unlike Silk damping, free-streaming causes the spatial variation (the x- 
dependence) of the photon distribution to decrease as a power law rather than 
exponentially with k. Furthermore, free-streaming does not make the distribution 
function isotropic. Although the spatial variation (the x-dependence) of the photon 
distribution is damped, the initial angular anisotropy (the I-dependence) is pre- 
served. Note that the perturbations on superhorizon scales are unaffected because 
the photons have no chance to mix. 


9.3 Initial conditions 


As follows from (9.15), the photons arriving from direction /' seen at the present 
time 70 by an observer located at x, propagate along geodesics 


x(n) = xo +1'( — no). (9.24) 


Therefore, from (9.21), we find that 67/T in the direction /' on the sky today is 
equal to 


6T 1 qi éT I I I i 
FH (Mos X02) = Fe XM), LY + Ole, XM) = (7,x9), (9.25) 


where 7, is the conformal time of recombination and x‘(1,) is given by (9.24). Since 
we can observe from only one vantage point in the universe, we are only interested in 
the /'-dependence of the temperature fluctuations. Hence the last term, which only 
contributes to the monopole component, can be ignored. The angular dependence 
of (6T/T), is set by two contributions: (a) the “initial” temperature fluctuations 
on the last scattering surface; and (b) the value of the gravitational potential ® at 
this same location. The first contribution, (67/T),, can be expressed in terms of 
the gravitational potential and the fluctuations of the photon energy density 6, = 
d€,/€, on the last scattering surface. For this purpose, we use matching conditions 
for the hydrodynamic energy-momentum tensor, which describes the radiation 
before decoupling, and the kinetic energy-momentum tensor, which characterizes 
the gas of free photons after ae 


To oye, 


f=se frh 
Substituting the metric (9.12) into (9.26) and assuming a Planckian distribution 


(9.8), we get an expression to linear order for the 0 — 0 component of the kinetic 
energy-momentum tensor 


Ow : ra pe 3, ~(T))4 6T \ = 3142 
To —: at(1 sey | F(F) ma P ~(T) [(1+4) fooy dyd l, 
(9.27) 





(9.26) 
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where y = @/T and po and p have been expressed in terms of w using (9.14) 
and (9.18). The integral over y can be calculated explicitly and simply gives a 
numerical factor that, combined with 47 Ts represents the energy density of the 
unperturbed radiation. This expression, describing the gas of photons immedi- 
ately after recombination, should continuously match the 0 — 0 component of the 
hydrodynamic energy-momentum tensor which characterizes radiation before re- 
combination: Ty = &y (1 + Oy) The matching condition implies 
) 4 eee 9.28 

a es ry 
Similarly, one can derive from (9.26) that the other components of the kinetic 
energy-momentum tensor are: 


Tz 4 a ore) (9.29) 
~ 4e ==, : 
? t T 4x 

Taking the divergence of this expression and comparing it to the divergence of 
the hydrodynamical energy—momentum tensor for radiation before recombination, 
which is given by (7.117), we get the second matching condition 


&=-4] I'v 67) dit (9.30 
= NT) an amy 


where we have neglected the radiation contribution to the gravitational potential 
and therefore set ©’(n,) = 0. 

It is straightforward to show that to satisfy both (9.28) and (9.30) the spatial 
Fourier component of the temperature fluctuations should be related to the energy 
density inhomogeneities in radiation as 


5T 1 eee 
(F) ans = a (% + a (kn )4). (9.31) 


Here and throughout this chapter we drop the subscript y, keeping in mind that 
the notation 6 is always used for the fractional energy fluctuations in the radiation 
component itself. Substituting (9.31) in the Fourier expansion of (9.25), we obtain 
the final expression for the temperature fluctuations in the direction | = (/', /”, 1°) 
as observed at location xp = (x!, x”, x): 


oT r) 35; 0 . aek 
,Xo, D = @ ‘ _ k ik(xo+Kn,—nN0)) , 9.32 
T (No, Xo, 1) al a i). Ak2 mals (Qn? (9.32) 


where k = |k|, k-1=k,,/” and k- xo = k,xG. Because n,/no is less than 1/30, 
we can neglect 7, in favor of no in this expression. The first term in square brackets 
represents the combined result from the initial inhomogeneities in the radiation 
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energy density itself and the Sachs—Wolfe effect, and the second term is related 
to the velocities of the baryon-radiation plasma at recombination. The latter term 
is therefore often referred to in the literature as the Doppler contribution to the 
fluctuations. The characteristic peaks in the temperature anisotropy power spectrum 
(described below) are sometimes called “Doppler peaks,” but we will see that the 
Doppler term is not the dominant cause of these peaks. 


9.4 Correlation function and multipoles 


A sky map of the cosmic microwave background temperature fluctuations can be 
fully characterized in terms of an infinite sequence of correlation functions. If the 
spectrum of fluctuations is Gaussian, as predicted by inflation and as current data 
suggest, then only the even order correlation functions are nonzero and all of them 
can be directly expressed through the two-point correlation function (also known 
as the temperature autocorrelation function): 


6T oT 
CO@)= (Fay ah)). (9.33) 


where the brackets () denote averaging over all directions 1, and l, satisfying 
the condition 1], -l, = cos(@). The squared temperature difference between two 
directions separated by angle 6, averaged over the sky, is related to C(@) by 


2 2 
(Fo) = (“5 Pb) ) = 2(C(0) — C(6)). (9.34) 
0 0 





The temperature autocorrelation function is a detailed fingerprint that can be used 
first to discriminate among cosmological models and then, once the model is fixed, 
to determine the values of its fundamental parameters. The three-point function, 
also known as the bispectrum, is a sensitive test for a non-Gaussian contribution to 
the fluctuation spectrum since it is precisely zero in the Gaussian limit. 

The cosmic microwave background is also polarized. An expanded set of n- 
point correlation functions can be constructed which quantifies the correlation of 
the polarization over long distances and the correlation between polarization and 
temperature fluctuations. For the purpose of this primer, however, we focus on com- 
puting the temperature autocorrelation function, since this has proven to be the most 
useful to date. The generalization to other correlation functions is straightforward. 

The universe is homogeneous and isotropic on large scales. Consequently, 
averaging over all directions on the sky from a single vantage point (e.g., the 
Earth) should be close to the average of the results obtained by other observers in 
many points in space for given directions. The latter average corresponds to the 
cosmic mean and is determined by correlation functions of the random field of 
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inhomogeneities. The root-mean-square difference between a local measurement 
and the cosmic mean is known as cosmic variance. This difference is due to the 
poorer statistics of a single observer and depends on the number of appropriate 
representatives of the random inhomogeneities within an horizon. The variance is 
tiny at small angular scales but substantial for angular separations of 10 degrees or 
more. 

Cosmic variance is an unavoidable uncertainty, but experiments usually introduce 
additional uncertainty by measuring only a finite fraction of the full sky. The total 
difference from the cosmic mean, called sample variance, is inversely proportional 
to the square root of the fractional area measured, approaching cosmic variance as 
the covered-by-measurements area approaches the full sky. 

Because of the homogeneous isotropic nature of the random fluctuations one 
can calculate the cosmic mean of the angular correlation function C(@) by simply 
averaging over the observer position Xp and keeping the directions 1, and Ip fixed. 
For the Gaussian field, this is equivalent to an ensemble average of the appropriate 
random Fourier components, (®, Py) = |B; 5 (k +k’ ); etc. Keeping this remark 
in mind and substituting (9.32) into (9.33), after integration over the angular part 
of k, the cosmic mean of temperature autocorrelation function can be written as 


5, 38, a be S88 
CO= 1.4 a op ae 
(9) I( ae san) ( kta 4k2 am 


sin(k [lin — lynz|) k?dk 
k\liny —lano| 2?” 





(9.35) 


where after differentiation with respect to n; and 72, we set n1 = no = no. Using 
the well known expansion 





in(k |] —| ee 
sinh Tan — Bara) _ (21 +1) fuckin) eens) PAc0s 8), (9.36) 
k hin oF Inn2| 1=0 


where P;(cos@) and j;(kn) are the Legendre polynomials and spherical Bessel 
functions of order /, respectively, we can rewrite the expression for C(@) as a 
discrete sum over multipole moments C,: 


C(@) = : y (21 + 1) CPM 0 (9.37 
ge ) Cr P;(cos 6). 37) 


The monopole and dipole components (/ = 0, 1) have been excluded here and 


_2 5m) \ 4. 384) dinlkno) |” , > 
c= = [|(ems+ 4 ) iden 4k d(kn) k°dk. (9.38) 
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If 67 /T is expanded in terms of spherical harmonics, 


oT (6, 
EO =F ain Yin.) (9.39) 


l,m 
then the complex coefficients q,,, in a homogeneous and isotropic universe, satisfy 
the condition 


(Ajr'Qim) = Ov 8mm C1, (9.40) 


where the brackets refer to a cosmic mean. The multipole moments, C; = (|aim 7), 
receive their main contribution from fluctuations on angular scale 6 ~ m/I and 
Ii + 1) C; is about typical squared temperature fluctuations on this scale. 


Problem 9.3 Generalize the formula (9.38) for ®/,(7,) 4 0, thereby incorporating 
the integrated Sachs—Wolfe effect. 


Inflation predicts a flat universe with a nearly scale-invariant, adiabatic spectrum 
of Gaussian fluctuations. As we shall show, these lead to certain qualitative features 
in the temperature anisotropy power spectrum: a flat plateau for large angular scales, 
a sequence of peaks and valleys with a first peak at / ~ 200, and a steady damping 
of the oscillation amplitude as / increases. Once these features are confirmed, then 
a precise measurement of the power spectrum can be used to constrain many of 
the cosmological parameters which inflation does not fix uniquely. First, there are 
the amplitude B and spectral index n of the primordial density inhomogeneities 
generated by inflation. The rather generic prediction of inflation is that |D2k3| — 
Bk"s—!, with 1 — ns ~ 0.03-0.08. The amplitude B is not predicted by inflation. Its 
value is chosen to fit the observations. The other parameters involved in defining the 
shape of the temperature power spectrum are the Hubble constant h7s, the fraction 
of the critical density today due to the baryon density Qz,, the total matter (baryonic 
plus cold dark matter) density Q,, and the vacuum (or quintessence) energy density 
Qa. 

The present data are consistent with inflation, and suggest a flat universe that 
consists approximately of 5% baryonic matter, 25% cold dark matter, and 70% dark 
energy. We will take these values for our fiducial model, also called the concordance 
model, and compute the temperature fluctuation spectrum for a range of parameters 
around this model. 


9.5 Anisotropies on large angular scales 


The fluctuations on large angular scales (9 >> 1°) are induced by inhomogeneities 
with wavelengths which exceed the Hubble radius at recombination and have not 
had a chance to evolve significantly since the end of inflation. Thus their spectrum 
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represents pristine information about the primordial inhomogeneities. In this section 
we show that for perturbations predicted by inflation, the spectrum of temperature 
fluctuations on large angular scales has a nearly flat plateau, the height and slope of 
which are mainly determined by the amplitude and spectral index characterizing the 
primordial inhomogeneities and practically independent of the other cosmological 
parameters. 

The Hubble scale at recombination H~! = 3t,/2 spans 0.87° on the sky today 
(see (2.73)). Therefore, the results derived in this section refer to the angles 0 > 1°, 
or to the multipoles 1 << m/@y ~ 200. 

As shown in Section 7.4, for adiabatic perturbations with kn, « 1, the rela- 
tive energy density fluctuations in the radiation component itself can be expressed 
through the gravitational potential as 


8 
5i(r) = — 3 Pelt), 5, (nr) = 0. (9.41) 


According to (9.32), the resulting temperature fluctuation due to large-scale imho- 
mogeneities is 


6T 1 
Fp (Mor Xo.) = 5 P(r, Xo — Ino). (9.42) 


That is, the fluctuation amplitude is equal to one third of the gravitational potential 
at the point on the last scattering surface from which the photons emanated. In this 
estimate, we neglect the contribution of radiation to the gravitational potential at 
recombination and both integrated Sachs—Wolfe effects, which are subdominant. 

After matter—radiation equality, the potential on superhorizon scales drops by a 
factor of 9/10. Taking this into account, substituting (9.41) into (9.38), and calcu- 
lating the integral with the help of the identity 


m 
0 r2@-mr(i+ =) 
[sre ds = gm-3 5 2 


4 ro(25")r(142-3) 


; seen fea oh af . 2 
we find for a scale-invariant initial spectrum with |(®?) k>| = B, the plateau: 





(9.43) 


ibs WE = const, (9.44) 


9B 
100 
on large angular scales or for / < 200. Since the main contribution to large an- 
gular scales comes from superhorizon inhomogeneities, we have neglected here 
the modification of the spectrum for subhorizon modes. In actuality, each C; is a 
weighted integral over all k, including near horizon and subhorizon scales, where 
the fluctuation amplitude rises and falls. The above result is a good approximation 
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for / up to 20 or so. For / > 20, the neglected effects become essential, leading 
first to the rise in amplitude of the temperature fluctuations and then to the acoustic 
peaks. 


Problem 9.4 Find the correction to (9.44) if the initial spectrum is not scale- 
invariant, |(2)” 13 | = Bk"s—!, assuming that |ny — 1| < 1. 


Problem 9.5 Determine how (9.42) is modified for the entropy perturbations con- 
sidered in Section 7.3. 


Unfortunately, the information about statistical properties of the primordial spec- 
trum gathered from a single vantage point is limited by cosmic variance. Since there 
are only 2/ + 1 independent aj,,, the variance is 

AS SOA, (9.45) 

C 

The typical fluctuation is about 50% for the quadrupole (/ = 2) and 15% for! ~ 20. 
Therefore, we are forced to go to smaller angular scales to obtain precise constraints 
on the spectrum of primordial inhomogeneities. The bad news is that, for these 
scales, we can no longer ignore evolution. On the other hand, if we can deconvolve 
the effects of evolution, we gain information about both the primordial spectrum 
and the parameters that control cosmic evolution. 


9.6 Delayed recombination and the finite thickness effect 


On small angular scales, recombination can no longer be approximated as instanta- 
neous. The finite duration of recombination introduces uncertainty as to the precise 
moment and position when a given photon last scatters. As a result, photons arriving 
from a given direction yield only “smeared out” information. In turn this leads to a 
suppression of the temperature fluctuations on small angular scales known as the 
finite thickness effect. The spread in the time of last scattering also increases the 
Silk damping scale, changing the conditions in the region from which the photon 
last scatters. 

We first consider the finite thickness effect. A photon arriving from direction 1 
might last scatter at any value of the redshift in the interval 1200 > z > 900. If the 
last scattering occurs at conformal time 7 ,, the photon carries information about 
conditions at position 


x(n) = Xo + (nz — no). 


Since the total flux of radiation arriving from direction I consists of photons that 
last scattered over a range of times, the information it carries represents a weighted 
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average over a scale Ax ~ An,, where An, is roughly the duration of recombina- 
tion. Clearly, the contribution of inhomogeneities with scales smaller than A7, to 
the temperature fluctuations will be smeared out and therefore strongly suppressed. 

Let us calculate the probability that the photon was scattered within the time 
interval At, at physical time t; (corresponding to the conformal time 7,,) and then 
avoided further scattering until the present time fg. We can divide the time interval 
to > t > t, into N small intervals of duration At, where the jth interval begins at 
time t; = t, + jAt and N > j > 1. The probability is then 


At, ( At ) At ( At ) 
P= eee eee Alpes or ee F (9.46) 
T(tr) T(t) t(t;) T(ty) 


1 
i) = Sl) XC) 


is the mean free time for Thomson scattering, n; is the number density of all (bound 
and free) electrons and X is the ionization fraction. Taking the limit N — oo 
(At — 0), and converting from physical time ¢ to conformal time 7, we obtain 


dP(nr) = we’ (nr) exp[—w(nz) 1 dn, (9.47) 











where 


where the prime denotes the derivative with respect to conformal time and j(,) 
is the optical depth: 


to n0 
d 
ne / oh i a aanai: (9.48) 
T(t) 
tL NL 


The uncertainty in the last scattering time causes us to modify our expression for 
the temperature fluctuation (9.32), replacing the recombination moment 7, by 7, 
and integrating over 7; with the probability weight (9.47): 


oT : d’k 
a Day se ik-KoAL 10) 1! eH : 9.49 
ii i 4 4k mal i Henan ape ten) 


Unlike in (9.32), here we cannot neglect 7, compared to 79 because for k > 
(An,)~' an oscillating exponential factor in (9.49) changes significantly during the 
time interval An, when the visibility function 


“(nt exp [—u(nz)] 


is substantially different from zero. The visibility function vanishes at very small 
nz (because jz > 1) and at large n; (where jz’ > 0), and reaches its maximum at 
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n, determined by the condition 
uw" =p". (9.50) 


Since recombination is really spread over an interval of time, we will use 7, 
henceforth to represent the conformal time when the visibility function takes its 
maximum value. This maximum is located within the narrow redshift range 1200 > 
z > 900. During this short time interval, the scale factor and the total number density 
n, do not vary substantially, so we can use their values at 7 = n,. On the other hand, 
the ionization fraction X changes by several orders of magnitude over this same 
interval of time, so its variation cannot be ignored. Substituting (9.48) in (9.50), we 
can re-express the condition determining 7, as 


X! ~ —(orn,a), X?, (9.51) 


where the subscript r refers to the value at 7,. For redshifts 1200 > z > 900, X is 
well described by (3.202). The time variation of X is mainly due to the exponential 
factor, and hence 

1.44 x 10+ 
aera 
where H = a’/a. Substituting this relation in (9.51) we obtain 


xX’ HX, (9.52) 


X, ~H,K(orn,a),', (9.53) 


where k = 14400/z,. Together with (3.202), this equation determines that the vis- 
ibility function reaches its maximum at z, ~ 1050, irrespective of the values of 
cosmological parameters. At this time, the ionization fraction X, isk ~ 13.7 times 
larger than the ionization fraction at decoupling, as defined by the condition t ~ t,, 
(see (3.206)). Near its maximum, the visibility function can be well approximated 
as a Gaussian: 


1 
lu’ exp (—L) exp (-3u —Inw’).t — n) (9.54) 


Calculating the derivatives with the help of (9.52) and (9.53), we obtain 


2 
Fe a ee (- serene (% = 1) (9.55) 





V 2m Ny 
where the prefactor has been chosen to satisfy the normalization condition 
[ wexp—wan 1 


The expression inside the square brackets in (9.49) does not change as much 
as the oscillating exponent and we obtain a good estimate just taking its value at 
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nL = ny. Then, substituting (9.55) in (9.49) and performing an explicit integration 
over nz, one gets 


3 
éT = [(e+}- min) eto? giktsottnr—mn)) 4A (9.56) 
T 4 4k dn /, (2x)! 
where 
4 (9.57) 
oS. SS Fy 
V6(KHn), 


In deriving (9.56) we replaced(k - 1)? with k?/3, using the fact that the perturbations 
field is isotropic. Now it is safe to neglect n, compared to no. 

To find out how o depends on cosmological parameters, we have to calculate 
(Hn), . At recombination, the dark energy contribution can be ignored and the scale 
factor is well described by (1.81); hence, 


1+(7/Nx) 
2 +(nr/N«) 
The ratio(7,/7,.) can be expressed through the ratio of the redshifts at equality and 
recombination using an obvious relation 


2 
(*) 4 2(*) pete, (9.59) 
Nx 1x Zr 


Taking this into account and substituting (9.58) in (9.57) we obtain 


2193 
Ge 1A e102 [ +(1 a <a.) (9.60) 


r 


(Hn), =2x (9.58) 


The exact value of z,,, depends on the matter contribution to the total energy density 
and the number of ultra-relativistic species present in the early universe. For three 
types of light neutrinos we have 
264 © 12.8(Qph2s) . (9.61) 
ra 
The parameter o is only weakly sensitive to the amount of cold matter and number 
of light neutrinos: for Q,47, ~ 0.3, 0 2.2 x 1077, and for Qyh3,~1, 0 & 
LOS< 1057, 


Problem 9.6 Find how o depends on the number of light neutrinos for a given cold 
matter density. 


Next let us consider how noninstantaneous recombination influences the Silk 
dissipation scale. As mentioned above, the ionization fraction at n = n, isk © 13.7 
times bigger than at decoupling and the mean free path is consequently « times 
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smaller than the horizon scale. Therefore, one can try to use the result (7.128), 
obtained in the imperfect fluid approximation, to estimate the extra dissipation 
during non-instantaneous recombination. 


Problem 9.7 Using (3.202) for the ionization fraction (which is valid when the 
ionization fraction drops below unity), calculate the dissipation scale and show 
that 


VQmh55 12 
(kpn);2 = 4.9 x 103 + Seo". (9.62) 
10 


The first term here is what was obtained for instantaneous recombination (equa- 
tion (7.132)), where we have expressed Q,,h? in terms of njo = 10!° Np/Ny (see 
(3.121)). This term accounts for the dissipation before recombination starts. The 
second term accounts for the additional dissipation during recombination. 


Note that the second term in (9.62) corresponds to a scale which is smaller than 
the mean free path t, at 7,, and so the imperfect fluid approximation cannot be 
trusted. However, within the time interval An ~ n,o, when the visibility function is 
different from zero, free photons have only enough time to propagate the comoving 
distance A ~ n,o, which is roughly the second term in (9.62). The inhomogeneities 
in the radiation can be smeared (mainly because of free streaming) only within these 
scales but not on larger scales. Therefore, the result (9.62) can be still used as a 
reasonable rough estimate for the scale below which the inhomogeneities will be 
suppressed. 

At very low baryon density the first term in (9.62) dominates, and most of the 
dissipation happens before ionization significantly drops. However, for realistic 
values of the dark matter and baryon densities, ahs ~ 0.3 and 719 ~ 5, the sec- 
ond term can be nearly twice the first term. Thus, the corrections to Silk dissipation 
due to noninstantaneous recombination can be important. 


Problem 9.8 If the baryon density is too low the approximations used above are 
invalid. What is the minimal value of 7)9 (or Q,h35) for which the derived results 
can still be trusted? For smaller values, how would the results be modified? 


In summary we have found that the finite duration of recombination produces 
two effects. First, the damping scale can be essentially greater than if recombination 
were instantaneous. Second, the uncertainty in the time of decoupling results in an 
extra suppression of temperature fluctuations on small angular scales. Although 
both effects are interconnected, they are distinct. 

The key formulae derived in the instanteneous recombination approximation are 
easily modified for the case of delayed recombination. Namely, for the damping 
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scale of radiation inhomogeneities one has to use (9.62) and the finite thickness 
leads to the appearance of the overall factor 


exp [—2(okn,)”] 


in the integrand of (9.38) for the multipole moments C7. 


9.7 Anisotropies on small angular scales 


For large / or small angular scales, the main contribution to C; comes from pertur- 
bations with angular size 6 ~ z/I on today’s sky. The multipole moment / ~ 200 
corresponds to the sound horizon scale at recombination. Hence, the perturbations 
responsible for the fluctuations with / > 200 have wavenumbers k > e's that is, 
they entered the horizon before recombination. These perturbations undergo evo- 
lution, causing a significant modification of the primordial spectrum. 

In Section 7.4.2, we found the transfer function relating the initial spectrum of 
gravitational potential fluctuations 0? to the spectra of ® and 4, at recombination 
in two limiting cases: namely, for the perturbations that entered the horizon well 
before equality, (7.143), and perturbations that enter well after equality, (7.134). 
However, for realistic values of the cosmological parameters, neither of these limits 
applies directly to the most interesting band of multipole moments corresponding 
to the first few acoustic peaks. The approximation (7.143) is valid only for modes 
which undergo at least one oscillation before equality (keg > 2./3 ~ 10), and 
approximation (7.134) is legitimate for modes which enter the horizon after the 
radiation density has become negligible compared to the matter density. If hal Ss 
0.3, then it follows from (9.61) that z.,/z, ~ 4, and the radiation still constitutes 
about 20% of the energy density at recombination. Hence, the asymptotics (7.134) 
and (7.143) are poor approximations for perturbations which enter the horizon near 
equality. These are precisely the modes responsible for the fluctuations in the region 
of first few acoustic peaks. Since the precise positions and shapes of these peaks 
provide valuable information about cosmological models, it is worth improving the 
approximation for the source function(®; + 6; /4),. in this region. 


9.7.1 Transfer functions 


If the speed of sound changes slowly one can use after matter—radiation equality 
the WKB solution (7.127), derived in Section 7.4, for the subhorizon modes. Since 
the gravitational potential no longer changes significantly at this time, we find that 
for a given amplitude of the gravitational potential 0? of the primordial spectrum 
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the source function at recombination is 


Nr 


5 1 
(e+) ee Ty(1 = 5) + Tove kf edn eK/koY | 6° (9.63) 
r S 0 


and 
Nr 
(8;), = —4Toke;"” sin | k i csdn | et (9.64) 
0 


where the transfer functions 7, and T, correspond to the constants of integration 
in the WKB solution and depend on whether the perturbation entered the horizon 
before or after equality. They were calculated in Section 7.4.2 in two limiting 
cases, namely, for the perturbations with kn. < 1, which entered the horizon long 
enough after equality, see (7.134), 


9 
Sk Pe as yi A: 9.65 
io 10" a 
and for the perturbations with kn., >> 1, which entered the horizon well before 
equality, see (7.143), 


T, > 


0.15kn¢ 39/4 
(0.15kteq) _ Gh Tt 07. (9.66) 


In 
fae ee 

(0.27kt¥eq)” 

Note that the transfer functions change very significantly. In particular, T,, is 
negligible for the perturbations which entered the horizon during the radiation- 
dominated stage and close to unity for the perturbations which entered the horizon 
long after that. The physical reason for such behavior is obvious. As we found in 
Section 6.4.3, for the subhorizon modes the gravitational instability in the cold dark 
matter component is suppressed during radiation-dominated stage and the gravi- 
tational potential decays; for the perturbation which enters the horizon when cold 
matter already dominates the amplitude of inhomogeneity in the cold component 
grows and the potential does not change. 7,, which defines the amplitude of the 
sound wave, is about 5 times greater for modes that enter the horizon well before 
equality than for those that enter the horizon long afterwards. This effect is due to the 
gravitational field of the radiation; it is significant when the modes with large kijeg 
enter the horizon and boosts the amplitude of the resulting sound wave compared to 
the case when the contribution of radiation to the gravitational potential is negligible. 
It is clear that for those perturbations which entered the horizon near equality, 
the appropriate values of the transfer functions should lie somewhere between their 
asymptotic values. As we mentioned above these perturbations with kn.g ~ O(1) 
determine the amplitude of the temperature fluctuations in the region of the first 
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few acoustic peaks and therefore are most interesting. Unfortunately, the transfer 
functions in the intermediate region between two asymptotics can be calculated 
only numerically. In general, 7, and T, should depend on k and 7.,, which by 
dimensional analysis must enter in the combination kn, , and the baryon density. To 
simplify the analysis, we will restrict ourselves to the case where the baryon density 
is small compared with the total density of dark matter, Q, < Q,,, a practical limit 
since this condition is satisfied by the real universe. With this assumption, we can 
neglect the baryon contribution to the gravitational potential compared with the 
contribution from cold dark matter, and the Q,-dependence of the transfer functions 
can be ignored. The result of the numerical calculation of T,, and 7, in the limit of 
negligible baryon density is presented in Figure 9.1. 

For intermediate range scales 10 > kneg > 1, which give the leading contribu- 
tions to the first acoustic peaks of the microwave background anisotropy, one can 
approximate T,, by 





14 
T, ~ 0.25 in 


J 9.67 
a.) (9.67 


and T, by 
T, ~ 0.36 In(5.6kjeq) « (9.68) 


The transfer functions are monotonic and approach their asymptotic values given 
by (9.65) and (9.66) in the appropriate limits. 
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9.7.2 Multipole moments 


To calculate the multipole moments C;, we have to substitute (9.63) and (9.64) into 
(9.38) with an extra exp(—2(akn,;)*) factor inside the integrand to account for the 
finite thickness effect. The resulting integral expressions are rather complicated, 
but can be very much simplified for / >> 1. We first remove the derivatives of the 
spherical Bessel function in (9.38) using the identity 


dj(y)\? Il +1) 1 d*(yj?(y)) 
( dy ) =[1- y? Ny 2y dy?” a 








which can be verified using the Bessel function equation. Substituting (9.69) into 
(9.38) and integrating by parts, we obtain 


2 6? , 9|eP +n 
a== fle; Pie lee) 


x(1 + A)e 27" 7? (kno) dk, (9.70) 








where A denotes corrections of order 7,/no and (kno)! , which were estimated 


taking into account (9.63) and (9.64) for the source functions. Recall that n,./no S 
gees /30 is small. As / — ox, we can approximate the Bessel functions as 


0, y<v, 
. 1 
LO) fea = cos [vera - vareeos( *) — 4 eV SV, 
y2(y2 os v2)4 y 4 
(9.71) 


where v/y # 1 is held fixed and v =/+ 1/2. Only those modes for which y = 
kno > 1 contribute to the integral and therefore the corrections of order (kno)! < 
I~! < 1 can also be neglected. Using the ee at (9.71) in the integrand of 
(9.70), and bearing in mind that the argument of j; (kno) changes with k much more 
rapidly than the argument in the oscillating part of the source functions (9.63) and 
(9.64), we can replace the cosine squared coming from (9.71) by its average value 
1/2. The result is 








1 |4@ + 3|7 k? 9Vknoy — ~Aokn,)? 
C; ~ As aa “lg e WY dk, (9.72) 
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where for large / we have set/ + 1 © I. 

Let us consider the case of a scale-invariant spectrum of initial density pertur- 
bations, |(@°)” k>| = B, where B is constant. Substituting (9.63) and (9.64) into 
(9.72) and changing the integration variable to x = kno/1, we get a sum of integrals 
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with “oscillating” (O) and “nonoscillating” functions(V) in the integrands, so that 
B 

li+1)C;~ —(O+N). (9.73) 
a 


Because of the cross-term that arises when the expression in (9.63) is squared in 
the integrand of (9.72), the oscillating contribution to /(/ + 1) C; can be written as 
a sum of two integrals: 








O=0,+ Or, (9.74) 
where 
,T uk -3 (" Hs ‘) is ‘ cos(lox) 
A =2Va(1- sa) f — x24/x2 — 1 e oD) 
1 
and 
= = f rl (528) qe HUIS? eos(Qlox)dx. (9.76) 
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Note that the periods of the cosines entering O; and O> differ by a factor of 2. 
As we will soon see, the acoustic peaks and valleys in the spectrum for /(/ + 1) C; 
result from the constructive and deconstructive interference of these two terms. The 
parameter 


Nr 


1 
ve / ONCE (9.77) 
no 


0 


Q 


determines the locations of the peaks. 
The scales /¢ and /s, which characterize the damping due to the finite thickness 
and Silk dissipation effects, are equal to 


2 2 
ne - IS? = 26? +kpn)?? (*) 9.78 
f (%) : C a Dii)r ) no ( ) 


where o is given in (9.60) and kpn, is given roughly by (9.62). 
Likewise, the nonoscillating contribution to/(/ + 1) C; isa sum of three integrals: 


N=N,+N2+N3, (9.79) 


where 





a, Tees)? 
m=(1-33) [eae vine 
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is proportional to the baryon density and vanishes in the absence of baryons where 
c? + 1/3. The other two integrals are 


CO 
C, ie el /s)°x? 
Np = , ae a (9.81) 
1 





and 





CO 
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Z “dx. (9.82) 


N3= 
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The microwave background anisotropy is a powerful cosmological probe because 
the parameters which determine the spectrum /(/ + 1) C;, namely, c,, 1, 1s, @ and 
the transfer functions T, and T,,, can all be directly related to the basic cosmological 
parameters Q,, Q,,, a, the dark energy equation of state w, and the Hubble 
constant h75. Before we proceed with the calculation of the integrals determining 
the anisotropies, we explore these relations, making the simplifying assumption 
that the dark energy is the vacuum energy density, so that w = —1. 


9.7.3 Parameters 


The speed of sound c, at recombination depends only on the baryon density, which 
determines how much c, differs from its value for a purely relativistic gas of photons, 
cs = 1/3. If we define the baryon density parameter 


1 3 Eb 
Ss Vor 
then the speed of sound is 
1 
ae 
* 3414 €) 


For baryon density Oph, ~ 0.035, one has € ~ 0.6. The physical reason for the 
dependence of c, on the baryon density is clear. The baryons interacting with 
radiation make the sound waves more “heavy” and therefore reduce their speed. 

The damping scales 1 and 1s, given in (9.78), each depend on the ratio n,/79. To 
calculate this ratio we introduce a supplementary moment of time no >> nx; > n,, 
so that at 7, the radiation energy density is already negligible and the cosmological 
term is still small compared with the cold matter energy density. Then we can 
separately determine 7,./7o and n,/7, using the exact solutions (1.108) and (1.81) 
respectively. 
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Problem 9.9 Show that n/n X Iaz, /”, where 


-1 
we (5 sy" | st Sao em) 


and y = sinh7!(Q,/Q,,)'/7 . Ina flat universe Q, = 1 — Q,,, and the numerical 
fitting formula 


A 0.09 (9.85) 


approximates (9.84) to an accuracy better than 1% over the interval 0.1 < Q,, < 1. 
Verify that 7, /7, is equal to 


n z 1/2 2 1/2 
aie (=) (: of <a!) 2 \ : (9.86) 
x Zeq Sr 


Combining the relations from Problem 9.9 we obtain 


n l ; 1/2 é 1/2 
= 1+ =) - (=) ie (9.87) 
"0 ae. ( Zeq Zeq i 


Using this result together with (9.60) for o, (9.78) becomes 





1/2 
ee 1530( + =) hey (9.88) 
Zeq 
where we recall from (9.61) that 
7 8310 (OU) (9.89) 
Zeq 


for three neutrino species. 

The result is that the finite thickness damping coefficient / ¢ depends only weakly 
on the cosmological term and Q,,, i, . For Q mls ~ ~ 0.3 and Qh?, ~ 0.7, we have 
1 ~ 1580, whereas for 02 oF erase ~ 1 aid Olas ~ 0, we find/, ~ 1600. 

The scale /; describing the combination of finite thickness and Silk damping 
effects can be calculated in a similar way. 


Problem 9.10 Using the estimate (9.62) for the Silk dissipation scale, show that 


—1/2 
1 +.0.56& 0.8 (Q_h35) 
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ls ~ 0.71} (9.90) 
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This estimate is not very reliable because the imperfect fluid approximation 
breaks down when the visibility function is near maximum. Nevertheless, compar- 
ison with exact numerical calculations shows that the discrepancy is less than 10% 
with numerical /; being slightly smaller than in (9.90). In contrast to /, the scale 
ls does depend on the baryon density, characterized by &. However, this depen- 
dence is very strong only for § < 1, when the second term inside the parenthesis 
in (9.90) dominates. For € = 0.6, we find 1s ~ 1100 if Qinhas ~ 0.3 and ls ~ 980 
for Opliz. aman 

The parameter p, which determines the location of the acoustic peaks, can be 
calculated by substituting the expression for cs(7) 


1 a(n) ‘ie 
5 = 1 ‘ 9.91 
an V3 % ( Ge 


with a(n) given by (1.81) into (9.77) and then integrating. 





Problem 9.11 Verify that 
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Although it is clear that @ depends on both baryon and matter densities, it is not 
apparent from this expression how o varies when we change their values. For this 
reason, it is useful to find a fitting formula for 9. Verify that the numerical fit 








(9.92) 


Q ~ 0.014(1 + 0.138)7! (Q4h35) 4 Ty (9.93) 


reproduces the exact result (9.92) to within 7% everywhere in the region0 < € <5, 
0.1 < Q,h75 < 1, whereas the function @ itself varies by roughly a factor of 3. 
Combining (9.93) with the numerical fit for 7, in (9.85), we obtain 


0.16 


o = 0.0141 + 0.13E)7!(QqnA33) (9.94) 


Note the unusual combination ahs We will see later that because of this we 
can hope to determine Q,, and h75 separately by combining the measurements of 
the location of the peaks with the measurements of other features of the microwave 
spectrum which depend only on Caines 


The parameter @ characterizes the angular size of the sound horizon at recom- 
bination on today’s sky. The size of the sound horizon drops as the baryon density 
increases. For a given physical size of the sound horizon, its angular scale on 
today’s sky should of course also depend on the evolution of the universe after 


382 Cosmic microwave background anisotropies 


recombination. This is why the parameter g and the location of the acoustic peaks 
in a flat universe are sensitive to the cosmological constant. 
The transfer functions T, and T, depend only on 


kneq = ae ~ 0.72(Qmh35)"” Tnlooox, (9.95) 


where [x99 = 1/200 and x = kno/I. 


Problem 9.12 Verify that 





Neq _ 2 In 2s 3 2 \-1/2 
. =(v2 1) = ~ 3.57 x 1073(Qmh2s) In. (9.96) 


As we will see, for the most interesting range 1000 > / > 200, the dominant 
contribution to the integrals in (9.73) comes from x close to unity for which 10 > 
kjeq > 1. Therefore, we can use for the transfer functions the approximations (9.67) 
and (9.68), which can be rewritten in terms of x and cosmological parameters as 


T(x) ~ 0.74 — 0.25 (P + Inx) (9.97) 
and 
T(x) ~ 0.5 + 0.36(P + Inx), (9.98) 
where the function 
I lzoo 


P(L, Qm,h75) = In (9.99) 


af Gala 


tells us how the transfer function determining the fluctuations in multipole / scales 
depends on the cosmological term and cold matter energy density. The physical 
reason for the dependence of the transfer functions on the matter density is explained 
in Section 9.7.1. 


9.7.4 Calculating the spectrum 


We will now proceed to calculate the multipole spectrum /(/ + 1) C;. The main 
contribution to the integrals O; and O2 in (9.75) and (9.76) arises in the vicinity of 
the singular point x = 1. 
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Problem 9.13 Using the stationary (saddle) point method, verify that for a slowly 
varying function f(x) 


, f(x) cos(bx) 
Vx —1 


0 -. af DY) a ait. 4 B 
~ oa ay Ps a a aa ag ; (9.100) 


d\ 
B=( ) . 
bdx /,_) 


For large b we can set B © 0 and the above formula becomes 


where 





ff (x) cos(bx) 
| eat 


(Hint Make the substitution x = y* + 1 in (9.100).) 


dx © f(1) cos(b + 7 ~) (9.101) 


Using (9.101) to estimate the integrals in (9.75) and (9.76), we obtain 


O~ [Ai cos(lo + 1/4) + Ay cos(2lo + 2/4)e t's”, (9.102) 


with the coefficients 





(P=0.738)=4, 3) (3 (1;2-172)2) 
a+é)'/4 


; 36P)* 
A= 0.14 0S+036P) 
(1+ &)!/ 


which are slowly varying functions of /. In deriving this expression we used (9.97) 
and (9.98) for the transfer functions, which are valid in the range of multipoles 
200 <1 < 1000. For / > 1000 the fluctuations are strongly suppressed and this 
effect is roughly taken into account by the exponential factor in (9.102). However, 
the expected accuracy in this region is not as good as for 200 < / < 1000. 

We note that the contribution of the Doppler term to O is equal to zero in this 
approximation. A precise numerical evaluation reveals that the Doppler contribution 


A; ~ 0.18 


’ 


, (9.103) 


to the oscillating integrals is small, only a few percent or less of the total, for multi- 
poles / > 200. 
Substituting (9.97) in (9.80) for the nonoscillating contribution N,, we obtain 


N, & & [0.74 — 0.25P)* Ip —(0.37 — 0.125P) 1) +(0.25)" bh], (9.104) 
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where the integrals 





oS 1 me 202 
In(I/1y) = i afta oN ax (9.105) 


can be calculated in terms of hypergeometric functions. However, the resulting 
expressions are not very transparent and therefore it makes sense to find a numerical 
fit for them. The final result is 


[ P - 0.22(t/1,)°? - 2.6] 





N, ~ 0.0632 ely, (9.106) 


1 + 0.65(I/1,)'“ 


Similarly, we obtain 


2 
~ 0.087 [P= 0.220/15) + LIF aig 
(+8)? 1+40.65(I/1s)'4 





2 (9.107) 


The Doppler contribution to the nonoscillating part of the spectrum is comparable 
to N2 and is equal to 





2 
0.033 [P —0.5(@/Is)°° + 2.2] ols 


~ (+8? Logs 1s" ee 


3 
The numerical fits for N reproduce the exact result for multipoles 200 < / < 1000 
to within a few percent accuracy for a wide range of cosmological parameters. 
The ratio of the value of /(/ + 1) C; for / > 200 to its value for low multipole 
moments (the flat plateau) is 


I+ 1)C, 
E+ VD) Crow: 





100 
= (O+N,+N2+N3), (9.109) 


where O, N,, No, N3 are given by (9.102), (9.106), (9.107) and (9.108) respectively. 
The result in the case of the concordance model (Qm = 0.3, Q, = 0.7, Qyp = 0.04, 
Qyor = 1 and H = 70 km s~! Mpc~') is presented in Figure 9.2, where we have 
shown the total nonoscillating contribution and the total oscillatory contribution by 
the dashed lines. Their sum is the solid line. 

Our results are in good agreement with numerical calculations for a rather wide 
range of cosmological parameters around the concordance model. Although nu- 
merical codes are more precise, the analytic expressions enable us to understand 
how the main features in the anisotropy power spectrum arise and how they depend 
on cosmological parameters. 
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Fig. 9.2. 


9.8 Determining cosmic parameters 


Assuming a Gaussian, adiabatic spectrum of initial density perturbations, as pre- 
dicted by inflation, the principal cosmological parameters are: the amplitude B and 
slope ng of the primordial spectrum, the baryon density § = 17(Q,h75), the cold 
matter density Q,,,, the vacuum density QQ, and the Hubble parameter h75. We shall 
consider how the spectrum changes as the parameters are varied around the best-fit 
model, Q, = 0.04, Q,, = 0.3, 2, = 0.7 and H = 70kms~! Mpc™!. Our formulae 
above are valid for a wide variation in each parameter, more than sufficient to cover 
the likely values, although some of the expressions break down if used in the limits 
of very high or very low densities. 

Keeping in mind the comments above on the physical origin of parameter de- 
pendence of the coefficients in the formulae describing the fluctuations, the reader 
can easily figure out why the main features of the spectrum depend in one way 
or another on these parameters. Therefore we omit physical explanations for these 
dependences in what follows. 


The plateau For a nearly scale-invariant spectrum, the anisotropy power spectrum 
on large angular scales(/ < 30) is a nearly flat plateau. The amplitude and slope of 
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the plateau can be used to determine the primordial spectral amplitude and spectral 
index. The accuracy is limited mainly by the cosmic variance and by the fact that C; 
for small / is a weighted integral over modes that also include wavelengths smaller 
than the Hubble scale, the contribution of which depends on other parameters, such 
as Qy, QQ, etc. This prevents a determination of the spectral slope to better than 
10% accuracy. To improve the accuracy and fix other cosmological parameters, one 
must go to smaller angular scales and get information about the acoustic peaks and 
other features of the spectrum. 


The location of the peaks and the spatial curvature of the universe The acous- 
tic peaks arise when the oscillating term O, given by (9.102), is superimposed 
on the “hill” given by the nonoscillating contribution N(/) = N; + Nz + N3 (see 
Figure 9.2). The peak locations and the heights depend on both contributions. The 
oscillation peaks in O alone come from the superposition of two cosine terms 
in (9.102), whose periods differ by a factor of 2. If |A,| «< A2, the peaks are 
located at 


1 
= ro'(n — :) (9.110) 


wheren = 1, 2,3...andq is given by (9.94). The first term on the right hand side in 
(9.102) has a period twice as large as that of the second term and its amplitude A; is 
negative. Therefore, it interferes constructively for the odd peaks (n = 1, 3, ...) and 
destructively for the even peaks (n = 2,4, ...). Moreover, because of the relative 
phase shift of the two cosine terms, their maxima do not coincide and the construc- 
tive maxima of their sum lies between the closest maxima of the two individual 
terms; that is, 


6+7 6+7 
n=(2=" ) ner n= (24 9S") ner (9.111) 


Here the notation 6 — 7, for instance, denotes a number between 6 and 7. If |.A;| >> 
A», the peaks move closer to the lower bounds of the intervals in (9.111). 

In the concordance model, where € ~ 0.6 and hee ~ 0.26 from (9.94) and 
(9.111), we find that 1; ~ 225 + 265 and 13 ~ 825 ~ 865. The situation is made 
more complicated by the nonoscillating contribution NV. As is clear from Figure 9.2, 
the hill causes the first peak to move towards the right and the third peak to move 
towards the left. 

The even peaks correspond to the multipoles where two terms in (9.102) de- 
structively interfere. The second peak should be located at 


GT 4 
iim ST are mo ~ 525 + 565 (9.112) 
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in the current best-fit model. However, for some choices of parameters the destruc- 
tive interference can annihilate this peak altogether. 

The consideration above refers to a spatially flat universe with Q;,, = 1. Let 
us now consider how the peak locations depend on the values of the fundamental 
cosmological parameters. If the universe were curved, the angular size of the sound 
horizon would change and the peaks would shift compared with the flat case. For 
instance, as follows from (2.73), in a universe without the cosmological constant 
nme Qn)! > Could we then accurately determine the spatial curvature simply by 
measuring the location of the first peak? The answer to this question is not as 
straightforward as it seems at first glance. According to (9.94), the value of o 
also depends on Q,,, 475 and Q, (through &), and so it is clear from (9.111) and 
(9.112) that the peak positions depend on these parameters. Over the range of 
realistic values, while the sensitivity to these parameters is not as strong as to the 
spatial curvature, it is nevertheless significant. As an example, if we take a flat 
universe with the current best-fit values of the cosmological parameters, and then 
double the baryon density (§ ~ 0.6 — & ~ 1.2), the first peak moves to the right 
by Al, ~ +20, the second by Al, ~ +40, and the third by Al; ~ +60. We note 
that the locations of the peaks depend on & « Q,h35, whereas the dependence 
on the cold matter density enters through o as Onn. An increase in Q,, has 
the opposite effect on peak locations: if we were to double the cold dark matter 
density (ghee ~0.3—- elie ~ 0.6), the first peak would move to the left 
by Al; ~ —20 and the second and third peaks by Al, ~ —40 and Al; ~ —60 
respectively. Thus, even keeping the spatial curvature fixed, the first peak can be 
shifted significantly (Al, ~ 40) by doubling the baryon density and simultaneously 
halving the cold matter density. This limits our ability to determine the spatial 
curvature precisely based on the first peak location only. Fortunately, the parameter 
degeneracy can be resolved by combining measurements of peak locations with 
peak heights, as described below. 


Acoustic peak heights, the baryon and cold matter densities, and flatness Substitut- 
ing /,, given by (9.111) and (9.112), into (9.99) and using (9.92) for o, we see that 
the factor J, is canceled in the expression for P. Hence, the peak heights predicted 
by (9.109) depend on the combinations Oia. and Q,h3; (or €). For fixed Ons 
an increase in the baryon density increases the height of the the first acoustic peak 
H,. For instance, beginning from the current best-fit model, doubling the baryon 
density increases Hj by a factor of 1.5, due principally to N; (proportional to €7) 
and O (since A; « &). Anincrease of the cold matter density for fixed € suppresses 
HY, since P decreases as has increases. For the cold matter density, the sensitiv- 
ity comes from the N>2 and N3 terms. Therefore, playing the various terms off one 
another, the height of the first peak can be held fixed for certain combinations of 
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changes in the baryon and cold dark matter densities. If the baryon density is made 
too large, though, an increase in Chas cannot compensate an increase in Q,h3; 
because its effect on H saturates for large values (and moreover malts cannot be 
much greater than unity). 

Current observations suggest that H; is 6-8 times the amplitude at large angular 
scales. Based on the peak height alone, we can be sure that the baryon density is less 
than 20% of the critical density. Although much better constraints can be obtained 
using the full power spectrum and additional data, it is important to appreciate that 
the peak height alone suffices to rule out a flat, baryon-dominated universe, which 
was, in essence, the original concept of the big bang universe. 

If the height of the first peak is kept fixed, the only freedom left is a simultaneous 
change of the cold matter and baryon densities, both of which can be either increased 
or decreased. For instance, we can still keep the height of the first peak unchanged 
simultaneously increasing the baryon and matter density by a factor of about 1.5 
from the concordance model densities. However, since the increases of the baryon 
and cold matter densities have opposing effects on the location of the peak, its net 
shift will be negligible. This explains why, for a fixed height, the location of the 
first peak depends sensitively on the spatial curvature only, allowing us to resolve 
the degeneracy in its determination. The current data on the location of the first 
peak strongly support the flat universe predicted by inflation. 

To break the degeneracy between baryon and matter density altogether, it suffices 
to consider the second acoustic peak, which results primarily from the destructive 
interference between oscillatory terms in (9.102), but bearing in mind that they are 
superimposed on the “hill” due to N. The first term in (9.102) makes a negative 
contribution to the second peak and has a coefficient that is proportional to &. 
The second term makes a positive contribution to the second peak but slightly 
decreases as € increases. Hence, one can see that the second peak shrinks as the 
baryon density increases. The two terms nearly cancel altogether when the baryon 
density is about 8% of the critical density, or about twice the best-fit value. Curiously 
enough, though, exact numerical calculations show that a tiny second peak survives 
for Ones ~ 0.26 even when the baryon density is made much greater than 8%. 
This is because an increase in the baryon density also increases N;, making the 
hill on which the oscillatory contributions rest much steeper in the vicinity of the 
second peak. In other words, the appearance of the second peak depends on a 
delicate cancellation and combination of diverse terms. So, for example, it would 
be incorrect to conclude that the baryon density is less than 8% of the critical density 
simply because one observes a second acoustic peak. 

However, combining information about the height of the first peak with the fact 
that the second peak exists (ignoring peak locations for the moment) does lead 
to good limits on both baryon density and cold dark matter density. We initially 
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showed that to keep the first peak height H, fixed, the baryon density can be 
increased only simultaneously with the dark matter density. That is, baryonic and 
dark matter work in opposite directions in terms of their effect on the first peak. 
Now we have seen that the second peak decreases as the baryon density increases. 
For the second peak, it turns out that an increase in cold dark matter density has a 
similar effect. Hence, the baryon density and the dark matter density work in the 
same direction in terms of how they alter the height of the second peak. This is 
because the positive contribution O2 « T? decreases more rapidly than the negative 
contribution O; « T,T, as Oa he. increases. Using both peak heights enables us 
to determine both the baryon density and the cold dark matter density and thus 
to resolve the degeneracy in the determination of Onna and Q,h35. For instance, 
assuming that the universe is flat and that cold matter constitutes 100% of the critical 
density, we can fit the data for the height of the first peak only if Q, ~ 0.08 for 
H =70kms_! Mpc~!. However, in this case the second acoustic peak is absent. 
It reappears only when we simultaneously decrease the cold dark matter and the 
baryon density. Therefore, based on the data, the combination of the first acoustic 
peak height with the fact that the second peak exists informs us that the cold dark 
matter density cannot exceed half of the critical density and that the baryon density 
is less than 8% of the critical density. Although these limit can be improved greatly 
by an analysis of the full anisotropy power spectrum and other observations, it is 
important to appreciate that the height of the first peak together with the existence 
of the second peak are in themselves convincing evidence of the following key 
qualitative features of our universe: that the total cold matter density is less than 
the critical density, that cold dark matter exists and that its density exceeds the 
baryon density. 


Combining peak height and location If information about the first two peak heights 
determines the baryon and matter density (of course, in combination with 7s), then 
adding information about the first peak location fixes the spatial curvature precisely. 
Data strongly suggest that the universe is flat and that the total energy density is 
equal to the critical density. At the same time, the peak heights suggest that the dark 
matter and baryon densities are significantly less than the critical density. Hence, 
some form of dark energy must make up the difference and dominate the density 
of the universe today. 

Hence, combining peak heights and their location we can conclude that dark 
energy exists. Note that this line of argument is totally independent of the su- 
pernova luminosity—redshift test (see Section 2.5.2), which leads to the same 
conclusion. 

Since the heights of the peaks depend on OF ne and their locations on eho, 
we can squeeze out even more information; namely, we can determine the Hubble 
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constant. The dependence of the peak location on h75 is modest, so a very accu- 
rate measurement of the microwave background is required to obtain a reasonable 
constraint on 75. For example, if the locations and heights of the peaks are deter- 
mined to 1% accuracy, then the expected accuracy for the Hubble parameter will be 
about 7%. 


Reconsideration of the spectral tilt Until now we have been assuming that the pri- 
mordial spectrum of inhomogeneities is scale-invariant with spectral index ns = 1. 
Inflation predicts that there should be a small deviation from perfect scale invariance, 
typically ns ~ 0.92—0.97. The above derivation for the microwave background fluc- 
tuations can easily be modified to account for these deviations. 


Problem 9.14 Show that the multipoles C; for a primordial spectrum with tilt 
ng are modified by a factor proportional to /"5~! compared with a scale-invariant 
spectrum. 


When we include uncertainty in the spectral tilt, then the heights and locations 
of the first two peaks are insufficient to determine Q2,, (2,, and h75 separately. Here 
is where the third acoustic peak comes into play. The height of the third peak is 
not as sensitive to Q,h7; and has as the first two peaks, but it is sensitive to the 
spectral index. Fixing these parameters and the height of the first peak, the ratio of 
the third peak height to the first, r = H3/H,, changes by a factor 


Ar Ll; ee: Ll; 
ela, (ee (es ~(ns — 1)In{ = ). (9.113) 
r ly ly 


For instance, if ns ~ 0.95, the height of the third peak decreases by about 7% 
compared with the case forns = 1. 


Summary Thus we have seen what a powerful tool the microwave background 
power spectrum can be. The general shape — a plateau at large angular scales and 
acoustic peaks at small angular scales — confirms that the spectrum is predomi- 
nantly nearly scale-invariant and adiabatic. (The higher-order correlation functions 
should be used to show that the spectrum is also Gaussian.) This supports the basic 
predictions of the inflationary/big bang paradigm. Then we can proceed to use the 
quantitative details of the spectrum — the plateau and the heights and locations of 
the acoustic peaks — to determine the primary cosmological parameters. 

Our analysis is valid for a limited parameter set; the inclusion of other physical 
effects or variants of the best-fit model weakens to some extent the conclusions 
that can be drawn from measurements of the anisotropy. For example, secondary 
reionization by early star formation at redshifts z > 20 or so reduces the small 
angular scale power in a way that is difficult to disentangle from tilting the spectral 
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index. The dark energy may comprise quintessence, rather than vacuum energy. 
In this case, we must introduce a new parameter, the equation of state of the dark 
energy w (or perhaps a function w(z)). Correlated changes in Q,,, 75 and w can 
produce canceling effects that leave the plateau and the first three peaks virtually 
unchanged. Thus, the temperature autocorrelation function is a powerful tool, but 
it is not all-powerful. 

To explore the range of possible models fully we need to use all the information 
the power spectrum offers, in combination with other cosmological observations. 
For example, the heights, locations and shapes of the peaks also depend on the dis- 
sipation scales / ¢ and/s, which in turn depend on combinations of the cosmological 
parameters. For the current best-fit model, for which /s ~ 1000, dissipation does 
not influence the first peak significantly, but it becomes increasingly important for 
the higher-order peaks. Hence, using more peaks in the analysis further constrains 
models. 


9.9 Gravitational waves 


An important physical effect that we have neglected thus far is that of gravitational 
waves, one of the basic predictions of inflationary cosmology. As discussed in 
Section 7.1, to describe gravitational waves we use the metric 


ds? = a? (dn —(Six + hix) dx'dx") (9.114) 


The gravitational waves correspond to the traceless, divergence-free part of hjx. 
They produce perturbations in the microwave background by inducing the redshifts 
and blueshifts of the photons. Using the equation p* p, = 0, we can express the 
zero component of the photon’s 4-momentum as 


1 ‘ 
iaw= FC = shalt’) (9.115) 
a 2 
where as before p = (=p?)"/ * l' = —p;/p and we have kept only the first order 
terms in metric perturbations. The photon geodesic equations for the metric (9.114) 
take the following forms: 
dx! dp j 1 Ohi 


=/'+O0(h), =— 
Ge OO San al ag 








ce (9.116) 


Taking into account that the distribution function f depends only on the single 
variable 





OF ae 5 
ne ae pee ee (9.117) 
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and substituting (9.116) in the Boltzmann equation (9.7), we find that the temper- 
ature fluctuations satisfy the equation 


ep OT LOR. 
apf pampered (9.118) 
on ox) } T 2 0n 





which has the obvious solution 


no 
éTA 1 f ahi, ; 
@ _ [ Setvay, (9.119) 
n 





T 2 

Nr 
Note that until now we have not used the fact that h;, is a traceless, divergence- 
free tensor. Therefore, (9.119) is a general result, which can also be applied when 
calculating the temperature fluctuations induced by the scalar metric perturbations 
in the synchronous gauge. For tensor perturbations, h;, satisfies the extra conditions 
hi = hi. ; = 0 (here we raise and lower the indices with the unit tensor 6;,), which 
reduce the number of independent components of h;, to two, corresponding to 
two independent polarizations of the gravitational waves. For random Gaussian 
fluctuations, the tensor metric perturbations can be written as 


3 


ikx dk 
hik(X, 0) = [ ronenttoe 7 Qn’ 


where e;;(Kk) is the time-independent random polarization tensor. Because of the 
conditions e! = elk; = 0, e;,(k) should satisfy 





(9.120) 


(eix(k) ej1(k’)) = (Pi; Per + Pir Peg — Pe Pi) 5(K +’) , (9.121) 
where 
Pi = (81; — kikj/W), (9.122) 


is the projection operator. Substituting (9.120) into (9.119) and calculating the 
tensor contribution to the correlation function of the temperature fluctuations (see 
the definition in (9.33)), we obtain 


3 


1 c d°k 
cT(@) = - i Fj, by, k) AKC) hy (H) eR) Id nd jh — 
4 (27) 





(9.123) 


where cos 6 = I, - lo. In deriving (9.123) we have averaged over the random polar- 
ization with the help of (9.121). The function F entering this expression does not 
depend on time and is equal to 


(1ik)(ok) \7 (ik)? (Ink)? 
F =2( a) -™) -(1- ei )(- 2 ) (9.124) 
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Introducing the new variable x = k(yo — 7) instead of n, and noting that 


(Ik) 0 _,, (bk) 0 Mag 
— ——g k — = ——e & 


a a aes: ; 


after integration over the angular part of k we can rewrite (9.123) as 





1 f dh, Ohe [x sin(|bX —11x]) _k?dk 
C'(@)= / EE dxd 9.125 
AJ ax dx bt —])x| Qn? ( ) 
where 
a oy? 9? 9? 
=2 See aN, as) ae ee ae Me 
(cosa ax =) ( +25)( +o) oe) 


Now we can use formula (9.36) to expand C’(6) as a discrete sum over multipole 
momenta (see (9.37)). After a lengthy but straightforward calculation, the result for 
C/ can be written in a rather simple form. 


Problem 9.15 Substitute (9.36) into (9.125) and use the recurrence relations for 
zP(z), the Bessel functions anon and the recurrence relations for the spherical 
Bessel functions to express j/’, jj-2, j;_, etc. through j), j/, to verify that 


2 


CG = kdk. (9.127) 
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0 


The derivative of the metric sche takes its maximal value at kn ~ O(1) 
and drops very fast after that. Hence, for those gravitational waves which entered 
the horizon, the main contribution to the integral over x in (9.127) comes from the 
relatively narrow region: kno > x > kno — O(1).For! > landkno > 1, the func- 
tion j)(x) /x? does not change significantly within this interval and can therefore 
be approximated by its value at x9 = kno. As a result, (9.127) simplifies for / >> 1 
to 





Cre (9.128) 
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where |hz (n,-) k? | should be expressed as a function of x9 = kno. The gravity waves 
generated during inflation, which enter the horizon after recombination but well 
before the present time, have a nearly flat spectrum at n = n,, so that 


|ne(n-) k?| = Bew © const (9.129) 


for n- | >k> No a Taking into account that for / >> 1 the main contribution to 
(9.128) comes from the perturbations with k ~ [/no, and substituting (9.129) into 
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(9.128), we can calculate the integral with the help of (9.43). The result, valid for 
no/nr >> 1 > 1, reads 


2 Id+1) 
152 (J + 3)(l — 2) 





Il+C] = Bow © 4.2 x 107° Bow. (9.130) 
(For example, 7o/n,- ~ 55 for ialies ~ 0.3.) This estimate fails when applied to 
the low multipoles and in particular to the quadrupole (J = 2). The quadrupole can 
be calculated numerically and the result, 


Il +1)C}|)_, ~ 4.4 x 107° Bey, (9.131) 
does not differ greatly from the expression in the right hand side of (9.130). 


Problem 9.16 Verify that the relative contribution of the tensor and scalar pertur- 
bations generated during inflation to the quadrupole is 
Cin 


~ 10.4c,(14+ 4), (9.132) 
Ces é 
where the expression on the right hand side must be estimated at the moment 
when the perturbations responsible for the quadrupole cross the Hubble scale on 
inflation. Taking 1 + p/e ~ 10~* and c, = 1 we find that the gravitational waves 
should contribute about 10% to the quadrupole component. In k inflation where 


Cs < 1 their contribution can be negligible. 


As we found in Section 7.3.2, the amplitude of the gravitational waves which 
entered the horizon is inversely proportional to the scale factor and, for k > n-', 
their spectrum at 7 = n, is already significantly modified. For instance, for k >> 


Neg 
1 e gr a 
laz(n,) | ~ 011) Bew( (=) (9.133) 
Neq Zeq 


Substituting this expression into (9.128), we obtain 


iy. 
i+1)ci « Byu(“) 


for 1 >> leg, where leg = No/Neq (leq X 150 for ahs ~ 0.3). In the intermediate 
region 55 </ < 150, the amplitude /(/ + 1) C/ also decays. Note that all results 
in this section were derived in the approximation of instantaneous recombination, 
which is valid in the most interesting range of the multipoles. 


Problem 9.17 Assuming that 7.7 <n, determine how /(/ + 1) C/ depends on / 
for no/Neq > 1 > no/nr- 


9.10 Polarization of the cosmic microwave background 395 


S+T 
1000 
= T 
= 100 
Re 
N 
in 
No 
S10 
Q 
+ 
= 1 
0.1 
i 
0.01 
10 100 1000 


Fig. 9.3. 


As with scalar perturbations, the contribution of the tensor perturbations to the 
CMB power spectrum also consists of a flat plateau at low multi-poles, due to 
the superhorizon gravitational waves at last scattering. However, for / > 55, the 
amplitude /(/ + 1) C7 decreases quickly. Figure 9.3 was drawn using a precise 
numerical code showing how the total spectrum is subdivided into scalar and tensor 
components for the concordance model. Note that the tensor component dies off 
rapidly where the acoustic peaks appear. Hence, detecting the tensor contribution 
to the temperature autocorrelation function relies on comparing the height of the 
plateau to the height of the acoustic peaks. It is difficult to separate this effect from 
reionization or a spectral tilt. Polarization proves to be the better test for detecting 
primordial gravitational waves. 


9.10 Polarization of the cosmic microwave background 


Thus far, we have focused on the temperature fluctuations in the cosmic microwave 
background, because the temperature autocorrelation provides the single, most 
powerful test for distinguishing cosmological models and determining cosmolog- 
ical parameters. However, there is more information to be gained by measuring 
the polarization and its correlation with the temperature fluctuations. In particu- 
lar, polarization provides the cleanest and most sensitive method of detecting the 
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primordial spectrum of gravitational waves produced by inflation, one of the most 
challenging predictions to verify. 

The polarization of the cosmic microwave background is an inevitable conse- 
quence in any model because recombination is not an instantaneous process. The 
quadrupole anisotropy of the microwave background, which is absent before re- 
combination begins, is produced by both scalar and tensor perturbations as recom- 
bination proceeds. In turn, this leads to radiation, scattered off electrons, through 
Thomson scattering, being linearly polarized. Note that if recombination were in- 
stantaneous, no significant polarization would be generated. Hence, measuring the 
polarization gives us a chance to uncover the subtle details of the recombination 
history. Note that Thomson scattering on the electrons does not produce any circular 
polarization. 

Just as with the temperature fluctuations, the most useful quantity to compute is 
the two-point correlation function for polarization. The polarization signal is very 
weak: it is expected to be only 10% of the total temperature fluctuations on small 
angular scales, decreasing to much less than 1% on large angular scales. Hence, as 
difficult as it is to detect the temperature fluctuations, detecting the polarization is 
an even more extraordinary experimental challenge. Nevertheless, experimentalists 
are up to the challenge and the prospects seem to be excellent. 


9.10.1 Polarization tensor 


The electric field E is always transverse to the direction of propagation of the 
electromagnetic wave, characterized by the unit vector n. Therefore, this field can be 
decomposed as E = E“e,, wherea = 1, 2 and e, are two linearly independent basis 
vectors perpendicular to n (Figure 9.4). Completely (linearly) polarized light always 
has vector E aligned in a definite direction, while in the opposite case of completely 
unpolarized radiation, all E directions perpendicular to n are equally probable. Jn 
the absence of circular polarization, the radiation polarization properties can be 


&2 


ey 


Fig. 9.4. 
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characterized entirely by the two-dimensional, second rank symmetric polarization 
tensor 


Pap = >((EaEs) — (EE "800. (9.134) 
where the metric tensor 2,4 = €q - €p and its inverse are used to raise and lower the 
indices, e.g. Eg = 8qa-E°. The brackets represent an average over a time interval 
much exceeding the typical inverse frequencies of the wave. The scalar product of 
two three-dimensional vectors e, is defined, as usual, with respect to the spatial Eu- 
clidian metric. The overall intensity of the radiation is proportional to J = (E,E°). 
If the light is polarized, then the brightness temperature of the radiation after it 
passes through the polarizer depends on its orientation m =m“e, and the temper- 
ature variations are 5T(m) « P,»m“m?. Thus, by measuring this dependence, one 
can determine the polarization tensor Pap. 


Problem 9.18 Calculate the polarization tensor and the fraction of polarization 
P =—4det|P}| (9.135) 
in two extreme cases: completely polarized and completely unpolarized radiation. 


Let us assume that P + 0 and consider the eigenvalue problem for the matrix 
Py 


Pa Dg = ADE (9.136) 


with positive eigenvalue A. Normalizing an eigenvector p, in such a way that p* = 
Dcp* = 24, we can express the polarization tensor P,, through the polarization 
vector Pq as 


Pub = PaPb — 4P°8ab- (9.137) 


In fact, one can see that for P,, given by (9.137), the vector p, is the solution 
of (9.136) with 4 = p?/2. The other independent solution of (9.136) is a vector 
fu perpendicular to p,, that is, f,p* = 0. Using the orthogonality condition, we 
immediately obtain from (9.137) that the appropriate eigenvalue is negative and 
equal to — p*/2, in complete agreement with the fact that the polarization tensor 
is traceless: P? = 0. The fraction of polarization can be expressed through the 
magnitude of polarization vector as 


= —Adet |Pf| = p*. (9.138) 
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In the orthonormal basis, where e, - ey = dap, one can define the Stokes param- 
eters 


O21 = SIPs, USIP: (9.139) 


Every direction on the sky n can be entirely characterized by the polar coordinates 
@ and 9g, in terms of which the metric induced on the celestial sphere of unit radius 
is 


ds? = gqpdx“dx? = d6? + sin’ Ody’. (9.140) 


In this case, it is convenient to use as a pair of polarization basis vectors e, the 
coordinate basis vectors eg and e,, tangential to the coordinate lines ¢ = const and 
6 = const respectively. Considering the appropriate orthonormal vectors eg and 
é, = ey / leo| , the Stokes parameters are 


Qoo = 21Po9 = —21P 99, Usp = —21P 69. 


Problem 9.19 Write down in the original basis eg, ey the covariant, contravariant 
and mixed components of the polarization tensor in terms of these Stokes parame- 
ters. 


The reader may question why we work with the polarization tensor and not simply 
with the polarization vector. The point is that the polarization tensor multiplied by /, 
and consequently the Stokes parameters, are additive for incoherent superposition 
of waves and can easily be calculated. The polarization vector is not additive, but 
nevertheless the physical interpretation of the polarization pattern is clearest in 
terms of this vector. In particular, if the radiation is completely polarized, it is 
aligned along the electric field. Note that only the orientation of p,, and not its 
direction, has a physical meaning because the polarization tensor is quadratic in 
Pa- For partially polarized radiation, the polarization vector points in the direction 
of the electric field of the waves which dominate in overall flux, and the magnitude 
of pq characterizes the excess of the waves with appropriate polarization. 


9.10.2 Thomson scattering and polarization 


Let us consider the linearly polarized elecromagnetic wave with electric field E 
scattered by the electron in the direction n (see Figure 9.5). After scattering, the 
wave remains completely polarized and its electric field is 


E =A(E x n) xn), (9.141) 


where the coefficient A does not depend on E and n. Taking into account that the 
polarization basis vectors e, are orthogonal to n, we find that, after scattering, the 
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Fig. 9.5. 


components of the electric field along vector e, are 
E, =E-e, = AE-e,. (9.142) 


If incoming light arriving from direction I is completely unpolarized, then the 
resulting polarization tensor can be calculated using (9.142) and averaging over all 
directions of E perpendicular to I. 


Problem 9.20 Show that in this case 
(Eq Es) = 3A” (E*) (gas —(- ea)(1 - es), (9.143) 
and 


I =(E,£*) = $A? (E*\(1+0-)’), (9.144) 
where (E _ is the average of the squared electric field in the incident unpolarized 
beam. (Hint Justify and use the following formula for averaging over directions of 
the electric field in the incident beam: 

(E'E/) = }(E*)(5" —1'1/), (9.145) 
where E', /' (i = 1,2, 3) are the components of the appropriate 3-vectors in some 
orthonormal basis.) 


Write down the polarization tensor and verify that f, =(1-e,) is an eigenvector 
of P; with negative eigenvalue. Show that the polarization vector p“ is the vector 
perpendicular to f, with norm 


> 1-d-ny~ 


== = 9.146 
PS T4+0- ee) 
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It follows from (9.146) that the incoming unpolarized radiation scattered at right 
angles (1-n = 0) comes out completely polarized in the direction perpendicular 
to the plane containing the vectors | and n as, for instance, the sunlight from the 
horizon is linearly polarized parallel to the horizon at midday. 

Now, if we generalize to the case of an incoming unpolarized radiation field with 
intensity J(l) « (E 2(1)) and take into account that (EE) and J are additive for the 
incoherent light, we obtain 


/ | 5e0( (ny) —d-e,)(- «| Wd 





Pap) = (9.147) 


/ [1 +d-n)?] JQ) d?1 


If the incoming radiation is isotropic (J(1) = const), then integrating over I, we 
find that P,,(n) = 0, that is, the scattered radiation remains unpolarized. The ex- 
pression inside the integrand is quadratic in 1 and can be expressed in terms of 
the quadrupole spherical harmonics. Thus, scattering induces polarization in ini- 
tially unpolarized radiation only if the incident radiation is anisotropic and only the 
quadrupole component of the anisotropy contributes to the polarization. 


Problem 9.21 If the vectors e, are orthonormal, then the triplet e;, e2 and n forms 
an orthonormal basis in three-dimensional space and one can specify the direction 
of the incident vector I by the Euler angles 6 and @. Verify that in this case the 
polarization tensor components given in (9.147) are 


lf [3 
Pu = —Py = = | ,{—ReYnJdQ, 
" = i/ AO 
P =5/ Vz JdQ 
= Fz — ; 
days AQ 


2) 1 
T= Yoo — ——Y. JdQ, 9.149 
I( 00 We ~) ( ) 


and Y;,,(@, g) are the appropriate spherical functions. 


(9.148) 


where 


9.10.3 Delayed recombination and polarization 


Before recombination begins, the radiation field has only dipole anisotropy (see 
(9.31)) and therefore polarization cannot be generated. If recombination were in- 
stantaneous, then after recombination the photons would propagate without further 
scattering and no polarization would be generated. Therefore, the background ra- 
diation becomes polarized only if recombination is delayed. Taking into account 
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that at conformal time 7, the probability of last scattering is given by (9.47) and 
Jin) «(To + 87x.) (9.150) 


we obtain (to leading order) from (9.147): 


1 2 
Pap) = 3 | Fence —(1-n)’) -(1-e,)-e0) 


6T 5 dl 
x — (Fiz, D wt) eo dij, —. (9.151) 
To An 


Hence, the polarization should be proportional to the quadrupole temperature fluc- 
tuations generated during the delayed recombination. To calculate 5T / To(ijz, 1) at 
the point of scattering x, resulting from the scalar metric perturbations, we can use 
(9.49) together with (9.48), where one has to replace no by 7, and integrate over 
the time interval 7, > nz, > 0, that is, 


AL 
5T 5 35 a : 
—(n,,)D= ® ey ceyee oh o> tH ik-[x+Kn,—71)] 
qmed=f for; en) 

0 


x MW (nre dnt 





ny? (9.152) 
Because we will be content with only a rough estimate of the expected polarization 
we note that the visibility function w/(#,)e~™ in (9.151) has a sharp maximum 
at 7, =n, corresponding to z, ~ 1050. Thus the polarization should be about 
the quadrupole temperature fluctuation at this time. As we have seen, the main 
contribution to the quadrupole component comes from perturbations with the scales 
comparable to the horizon scale, that is, with kn, ~ 1. One can get an idea of the 
amplitude of this component by noting that the quadrupole is proportional to terms 
quadratic in | arising from the expansion of 


exp [ik -I(nz — 71)) ~ exp [tk - Mz — nr)] 


in powers of lin (9.152) (note that the higher multipoles also contain terms quadratic 
inl). Because the visibility function has a sharp peak of width An ~ on; (see (9.55), 
(9.57) and (9.60)), we can estimate k -I(n, — n,) as o for kn, ~ 1. Therefore, 
the quadrupole components at 7,, and hence the expected polarization should be 
about O(1)a ~ 10~7-107! times the temperature fluctuations observed today on 
angular scales corresponding to the recombination horizon. Polarization, then, is 
proportional to the duration of recombination and vanishes if recombination is 
instantaneous. Numerical calculations show that polarization never exceeds 10% 
of the temperature fluctuations on any angular scales. 
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9.10.4 E and B polarization modes and correlation functions 


To analyze the field of temperature fluctuations we computed the temperature 
autocorrelation function. The polarization induced at the last scattering surface 
is characterized by the tensor field Pap(m) on the celestial sphere. The induced 
polarization is correlated at different points on the sphere and, as with the 
temperature fluctuation field, can be characterized by the correlation functions 
(Pap(1) Pca(M2)) « 

The symmetric traceless tensor P,,(n) has two independent components and 
therefore, instead of P,,(m) itself, it is more convenient to consider two independent 
scalar functions built out of the polarization tensor: 


EM) =P,,, Bn) = Pe, (9.153) 


where ; denotes the covariant derivative on the two-dimensional sphere with metric 
(9.140) and 


1 
Ep = va ©, :) (9.154) 


is the two-dimensional skew-symmetric Levi—Civita “tensor.” It behaves as a tensor 
only under coordinate transformations with positive Jacobian. Under reflections, 
€., changes sign. Therefore, only the E mode of polarization is a scalar, while the 
B mode is a pseudo-scalar, reminiscent of an electric (E) and magnetic (B) field 
respectively. 

The most important thing is that the B mode is not generated by scalar perturba- 
tions. To prove this let us consider the polarization induced by inhomogeneity with 
wavenumber k. We use the particular spherical coordinate system where the z-axis 
determining the Euler angle 6 coincides with the direction k. In this coordinate sys- 
tem, the direction of observation n is characterized by the polar angles @ and g, and 
k-n=kcos6. For every n we can use the orthogonal coordinate vectors eg(n) and 
e,(n), tangential to the coordinate lines on the celestial sphere, as the polarization 
basis. As is clear from (9.152), the -dependence of the temperature of the incident 
radiation appears only in the combination k -1. Therefore, from (9.151) we may 
easily infer that the nondiagonal component of the polarization tensor should be 
proportional to 


Pool(@, 0) x(k eg)(k- ey) - (9.155) 


This component vanishes because in our coordinate system the vector e,(n) is 
transverse to k at every point on the sphere. The diagonal components of P can 
depend only on k- eg, k- n and the metric (all of which are y-independent) and 
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therefore, the general form of the polarization tensor is 


(9.156) 


Pap(, 0) = ( 2) : ) 


0  —@(6)sin* 6 
where we have taken into account that P? = 0. 


Problem 9.22 Calculate EF and B for the polarization tensor given by (9.156) and 
verify that B = 0 in this case. 


Because B(6, ~) is a pseudo-scalar function, it does not depend on the coordinate 
system used to calculate it and vanishes for every mode of the density perturbations. 
Thus density perturbations generate only E mode polarization, which describes the 
component of the polarization with even parity (the scalar perturbation with given 
k is symmetric with respect to rotations around k and therefore has no handedness). 
It is easy to see that for Pz» given by (9.156) the appropriate polarization vectors 
are proportional to eg at every point where Q(0) > O and proportional to e, for 
Q(@) < 0. Therefore, in terms of polarization patterns, the E mode produces 
arrangements of polarization vectors which are oriented radially or tangentially to 
the circles with respect to the density perturbations, respecting their axial symmetry, 
as illustrated in Figure 9.6. 

In contrast with scalar perturbations, gravitational waves also generate B mode 
polarization. To see this, let us consider a gravitational wave with wavenumber k 
in a coordinate system where the z-axis is aligned along k. Taking into account the 
general structure of the temperature fluctuations induced by gravitational waves 
(see (9.119), (9.120)), we can infer from (9.151) that the nondiagonal component 
of the polarization tensor is proportional to 


Pay & eix(€)! (e,)" , (9.157) 


where e;, is the polarization tensor of the gravitational waves. 
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Fig. 9.6. 
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Problem 9.23 Using (9.121), verify that after averaging over the random polariza- 
tions e;x, the component (P5,) does not vanish and depends only on @. Calculate 
the B mode polarization in the case of nondiagonal ?,,(@) and show that it is 
generically different from zero. 


The polarization vector p, induced by the gravitational wave is a linear combi- 
nation of eg and e,. Therefore, the polarization vectors are oriented in circulating 
patterns, as illustrated in Figure 9.6. In this case the B mode, which has odd parity 
(handedness), does not vanish. This is due to the fact that the gravitational wave 
is not symmetric with respect to the rotations around k. Hence, the gravitational 
waves present at recombination can be detected indirectly via the B mode of the 
CMB polarization. 

To characterize the polarization field on today’s sky one can use the appropriate 
correlation functions, for instance, 


ET(g) — oT 
Ch OPS VE Mi) = (na) }s (9.158) 


where the averaging is performed over all directions on the sky satisfying the 
condition n, - Ny = cos @. The other correlation functions are C8’, C£*, C®8 and 
C8. As in the case of temperature fluctuations, the polarizations E(n) and B(n) 
can be expanded in terms of the scalar spherical harmonics: 


E = Soa Yin(O,0), B= Saf, Vim(9, 6). (9.159) 
l,m l,m 


Since we directly measure the polarization tensor itself, however, it is not very prac- 
tical to take second derivatives of the experimental data to calculate the coefficients 
dim. Instead we note that 


1 
a= = / Eq) Y*,()d’n = / Pap Vind 0 =H if Pvp” dn, (9.160) 


_ /2— 2)! 
MEM GEOL 


E _ 1 c 
Yintab) = Ni (Yimsab — 58abV jm -) 


are the E-type tensor harmonics which obey orthogonality relations analogous to 
the scalar spherical harmonics: 


where 


and 


/ Ye OVE ed WS ip Orit: (9.161) 
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In deriving (9.160) we integrated twice by parts and took into account that the 
polarization tensor is traceless. 
Likewise, we have 


an = val Paved (9.162) 


in terms of the normalized B-type tensor harmonics 


N 
Vi ab) = = a (Vintaces, + Viese . (9.163) 


Note that the E- and B-type tensor harmonics only exist for / > 1 and, taken 
together, form a complete orthonormal basis for second rank tensors on the sphere. 
Therefore, the polarization tensor can be expanded as 


Pap = S [af Vian) + in Vircaby | » (9.164) 
lm 

where, as follows from (9.160) and (9.162), a = Nar. Thus, instead of first 
calculating the second derivative of the polarization tensor and then expanding it 
in terms of the scalar spherical harmonics, we can simply expand the polarization 
tensor itself in terms of the tensor harmonics. Then, in addition to the usual C; = 
(a;,,dim) Characterizing the temperature fluctuations, the polarization of the CMB 
fluctuations can be described in terms of the sequence of multipoles: 


CP Sia an Soe ain Ce” Saran), 
1 1 1 l 1 1 (9.165) 
Cr = airae)s. Cr? = Gyan). 


Problem 9.24 Find the explicit expressions for the tensor spherical harmonics in 
terms of the usual scalar spherical harmonics. 


Although the tensor harmonics are technically more complicated than the scalar 
harmonics, the point is that, given the orthogonality relations, the analysis of the 
polarization correlation functions is exactly parallel to that of the correlation func- 
tion for the temperature fluctuations. In Figure 9.7 we present the numerical result 
for the concordance model. In the models without reionization the /-dependence of 
li+1)C - ¥ andi(l+1)C . 8 can easily be understood if we take into account that 
the polarization is proportional to the quadrupole component of the temperature 
fluctuations at recombination. In turn, this quadrupole component is mainly due to 
the perturbations which, at this time, have scales of the order of the horizon and 
smaller. Therefore, the correlation functions /(/ + 1) C ne and 17 + 1)C drop 
off for / < 100, corresponding to the superhorizon scales at recombination. We 
have seen above that the amplitude of the gravitational waves, and their contribu- 
tion to the quadrupole component of the temperature fluctuations at recombination, 
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decreases on subhorizon scales. Hence, they do not contribute to the correlation of 
the B-type polarization on angular scales corresponding to / > 100. As a result, 
the function /(7/ + 1)C a reaches its maximum at / ~ 100. In contrast with B- 
polarization, due to scalar perturbations on subhorizon scales, there are substantial 
correlations of E-polarization for / > 100. 

The correlation function C4" is the easiest to measure since it entails a cross- 
correlation between the temperature fluctuation amplitude, which is large, and the 
largest (E mode) polarization component. Measured at / > 50, C7 would supply 
us with information about the history of the recombination. 

The B mode polarization is an especially important object of microwave back- 
ground measurement, since it is the most decisive and probably the only realistic way 
of detecting the nearly scale-invariant spectrum of gravitational waves predicted by 
inflation. The cross-polarization multipoles C87 are the easiest moments to detect 
if one is searching for signs of gravitational waves. The technological challenge of 
detecting the B mode is considerable. We have already noted that the polarization 
signal is small, but the B mode polarization component itself is a small fraction of 
the total polarization in typical inflationary models, as shown in Figure 9.7. In addi- 
tion, there are foregrounds to consider. For example, the lensing of the microwave 
background by foreground sources distorts the background polarization pattern in 
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such a way that purely E mode polarization appears to have a B mode component. 
Nevertheless, present projections suggest that, during the next decade, the most 
likely range of gravitational wave amplitudes predicted by inflationary cosmology 
can be fully explored. 


9.11 Reionization 


At late stages, when nonlinear structure begins to form in the universe, neutral 
hydrogen can be reionized. In fact, analyzing the spectra of the most distant quasars, 
one can conclude that most of the intergalactic hydrogen is ionized at z = 5. If it 
were not, the spectra would be significantly corrupted by the absorptions lines of 
the intergalactic neutral hydrogen. After reionization, the CMB photons can scatter 
on the free electrons and therefore late reionization affects the resulting CMB 
fluctuations. 

Let us first establish how reionization influences the spectrum of the temperature 
fluctuations. The probability that the photon has avoided scatterings and propagated 
freely from time ¢ until the present time fo is equal to 


P(t) = lim (1 = 1 a 
= gm(1- 25) (sp) 


May 
= exp (- | «) = exp (—(f)), (9.166) 
+ T(t) 
where t =(07 Xn ke is the mean free time for Thomson scattering, n, is the total 
number of all electrons and X is the ionization fraction. The optical depth w(t) = 

(Zz) entering (9.166) can be rewritten as an integral over the redshift parameter 


_ f° 4g, fi xno 
wa) = f “ot , Hosta (9.167) 


Let us assume instantaneous reionization at redshift z, >> Zjo, >> 1 and calculate 
the optical depth j2(z;o,) in a flat universe. The total number of all (free and bound) 
electrons is equal to 





3H? 
n,(z) ~ 0.88 x £6(2) = 0.88 x ul Q,pd + z), (9.168) 
Mp Simp, 


where the factor 0.88 accounts for the fact that about 12% of all baryons are 
neutrons. At high z, which give the main contribution to the integral (9.167), we 
can neglect the cosmological constant in comparison with the cold matter and 
use the following expression for the Hubble parameter: H(z) ~ Ho Qh! 4 + zy! a 
Substituting it with (9.168) into (9.167), and assuming that X ~ 1 at z < Zjon, we 


408 Cosmic microwave background anisotropies 
obtain 


Qyh55 32 
7 Zion: 
y 2nhjs 
2 


In the concordance model, where 02,45, ~ 0.04 and aha ~ 0.3, the reionization 
at Zion < 20 implies an optical depth wz ~ 0.2. If reionization happens at Zion ~ 5, 
then yz ~ 0.02, and in this case the overall effect of reionization on the CMB 
fluctuations does not exceed 2% or so of the total fluctuations. 

As a result of reionization, the fraction 


1(Zion) = 0.03 (9.169) 


l= exp (= tion) 


of all photons will be rescattered on the electrons. The remaining fraction 
exp (—L(Zion)) will not be influenced and will give the usual contribution to the 
fluctuations. For instance, in the model with z;,, ~ 20, about 80% of all photons 
will not be influenced by reionization. The contribution of the 20% of the photons 
that are rescattered to the angular power spectrum depends on the multipole /. After 
rescattering, the photon changes its direction of propagation and then the position 
from which it appears to emanate can be any point that is remote from the original 
scattering point at a distance not exceeding the horizon scale at this time. As a 
result, the contribution of the rescattered photons is smeared out within the angular 
scales corresponding to the reionization horizon scale and does not give rise to 
temperature fluctuations. For those / which correspond to superhorizon scales the 
fluctuations will not be influenced because of causality. Of course, at z < Zj,, the 
horizon scale continues to grow but the optical depth decreases and the fraction 
of the photons which are rescattered drops; therefore we neglect this effect. The 
angular size of the horizon at z;,,, corresponding to the conformal time 7;,,, can 
easily be found from (2.69) if we take into account that the physical size of the 
reionization horizon is equal to a(Hion) Nion ANd Xem = No — Nion X No. Then, in 
a flat universe, we get A@jon ~ Nion/No and the appropriate multipole moment is 
equal to 

len = 1 /AGion = ™No/Nion = ez, OO (9.170) 
where for the ratio no/Nion we have used the results of Problem 9.9. Thus, in the 
model with reionization the observed temperature fluctuations are 


l Licins 
Ce = le < (9.171) 


exp(—m)C;, 1 > lion, 


where the intrinsic temperature fluctuations C; have been calculated in the previous 
sections. If Q,, ~ 0.3 and Zjon ~ 20, we have /;,, ~ 12. In this case the amplitudes 
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of the higher multipoles are suppressed by about 20% compared with their original 
value, while the lower multipoles are untouched. As we have already mentioned, 
this can imitate the tilting of the spectral index to a certain extent and lead to an 
extra degeneracy and further cosmic confusion. 

This degeneracy can be easily resolved if we consider the influence of the reion- 
ization on polarization spectra. In fact, reionization leads to distinct features in 
these spectra. The temperature fluctuations in the scattered fraction of photons are 
not completely washed out on the scales corresponding to the reionization horizon. 
As a result, there is a net contribution of the rescattered photons to the total tem- 
perature fluctuations for the multipoles ~ /;,,, and it is polarized. It is obvious 
that the extra polarization induced by reionization is proportional to the fraction 
of the rescattered photons, 1 — exp(—/4(Zjon)), and to the quadrupole anisotropy of 
the rescattered photons at the beginning of reionization. Because this quadrupole 
anisotropy is mostly due to the perturbations with scales of order of the reionization 
horizon, an extra contribution to the polarization correlation functions should have 
a local maximum at / ~ /;,,. This explains the behavior of the correlation func- 
tions in Figure 9.7 where the results for the polarization in the concordance model 
with Zjon < 20 are presented. We would like to stress that because of the presence 
of the long-wavelength gravitational waves, both FE and B modes of polarization 
will be generated. Thus, we see how measuring the polarization at low multipoles 
can reveal details of the reionization history and help us to resolve the degeneracy 
problem. 
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